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Description 
NOVEL CYSTINE KNOT PROTEIN AND 
MATERIALS AND METHODS FOR MAKING IT 

BACKGROUND OF THE INVENTION 

In multicellular animals, cell growth, differentiation, and migration 
are controlled by polypeptide growth factors and hormones. These growth 
factors play a role in both normal development and pathogenesis, including the 
development of solid tumors. 

Polypeptide growth factors and homnones influence cellular 
events by binding to cell-surface receptors. Binding initiates a chain of 
signalling events within the cell, which ultimately results in phenotypic changes 
such as cell division and production of additional hormones. 

One family of hormones is the glycoprotein hormone family, which 
includes luteinizing hormone, follicle-stimulating honnone, thyroid-stimulating 
hormone, and chorionic gonadotropin. The first three are synthesized in the 
anterior pituitary, while chorionic gonadotropin is synthesized in the placenta, 
reaching a maximum at 10-12 weeks after conception and -declining thereafter 
to the end of pregnancy. 

The four glycoprotein homiones are structurally and functionally 
related. All four are glycosylated and consist of two non-covalently associated 
subunits, term a and p subunits. A single a subunit is common to all four 
hormones, while the p subunits are unique and confer biological specificity. 
The different p subunits are of similar size and have a significant degree of 
pairwise homology; the p subunits of human chorionic gonadotropin (HCG) and 
human luteinizing hormone are 82% identical, and the other pairs of p subunits 
are about 30-40% identical. Twelve cysteine residues are conserved among 
the four p subunits. The common a subunit exhibits detectable homology to 
the p subunits and includes six of the twelve conserved cysteine residues. 
See, Fiddes and Goodman, Nature 281:351-356, 1979; Fiddes and Goodman, 
Nature 286:684-687. 1980; Talmadge et al.. Nature 307:37-40, 1984; and 
Pi. rce and Parsons, Ann. Rev. Biochem. 50:465-495, 1981. The polypeptides 
form characteristic higher-order structures having a bow tie-like configuration 
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about a cystine knot, formed by disulfide bonding between three pairs of 
cysteine residues. Dimerization occurs through hydrophobic interactions 
between |oops of the two monomers. See. Daopin et al., Science 257:369. 
1992; Lapthorn et al.. A/afure 369:455, 1994. 

The cystine l<not motif and bow tie-like fold are also characteristic 
of the growth factors transfomiing growth factor-beta (TGF-p). nerve growth 
factor (NGF). and platelet derived growth factor (PDGF). These proteins are all 
dimers in their active fonms, the monomer subunits of which contain from 100 to 
130 amino acid residues. Although their amino acid sequences are quite 
divergent, these proteins, as well as the glycoprotein hormones, ail contain the 
six conserved cysteine residues of the cystine knot. 

The glycoprotein honnones act in a stage- and tissue-specific 
manner. LH, FSH, and TSH are produced in the pituitary. Luteinizing homaone 
stimulates steroid production in the testes and ovaries, which in turn stimulates 
spermatogenesis and ovulation. FSH is also a regulator of gametogenesis and 
steroid hormone synthesis in the gonads. TSH regulates a variety of processes 
in the thyroid, thereby controlling synthesis and secretion of thyroid hormones. 
HCG. produced in placenta, stimulates the ovaries to produce steroids that are 
necessary for the maintenance of pregnancy. For review see Pierce and 
Parsons. Ann. Rev, Biochem. 50:465-495, 1981. 

A more recently discovered member of this family, designated 
Norrie disease protein (NDP), is believed to be a regulator of neural cell 
differentiation and proliferation (Berger et al.. Nature Genetics 1:199-203. 
1992). NDP is expressed in retina, choroid, and fetal and adult brain. A lack of 
functional NDP is associated with Norrie disease, an X-linked disorder 
characterized by blindness, deafness, and mental disturbances. A number of 
variant forms of the protein, including deletions and point mutations, have been 
identified in Norrie disease patients. See, for example. Berger et al., ibid,\ 
Fuchs et al, Hum. Moi Genet 3:655-656. 1994; and Meindl et al.. Nature 
Genef/cs 2:139-143. 1992. 

Another group of related proteins is the growth and differentiation 
factors (GDFs). One member of this group, known as GDF-8 or myostatin, 
appears to act as a negative regulator of muscle mass (McPherron and Lee, 
Nature 387:83-90, 1997; McPherron and Lee. Proc. Natl. Acad ScL USA 
94:12457-12461, 1997; and Grobet et al,. Nat Genet 17:17-71, 1997). Many 
GDFs share 20-40% sequence homology with each other and with TGF-p 1, 2, 
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and 3. The discovery of the GDFs supports the postulated existence of 
"chalones", soluble factors hypothesized to control organ size and regeneration 
(Bullough. Cancer Res, 25:1683-1727, 1965; Bullough. BioL Res. 37:307-342. 
1992). 

The role of homnones in controlling cellular processes makes 
them likely candidates and targets for therapeutic intervention. Examples of 
such proteins that are used therapeutically include insulin for the treatment of 
diabetes and erythropoietin for the treatment of anemia. Gonadotropin has 
been used to induce ovulation (e.g., Fleming, Am, 1 ObsteL Gynecol. 159:376- 
381, 1988), to induce scrotal descent of cryptorchid testes (Lala et a!., J. Urol. 
157:1898-1901, 1997), and to stimulate intratesticular testosterone production 
in men who have undergone varicocelectomy (Yamamoto et a!., Arch. AndroL 
35:49-55, 1995). Clinical studies have shown that hCG can have antitumor 
activity against Kaposi's sarcoma (Gill et al., J, Nati Cancer Inst 89:1797- 
1802, 1997). Assays for the presence of chorionic gonadotropin are used to 
detect pregnancy. Vaccines against hCG have shown promising results in 
early tests for preventing pregnancy and inhibiting the growth of hormone- 
dependent cancers (Talwar, Immunol. Cell BioL 75:184-189, 1997). 

In view of the proven clinical utility of hormones, there is a need in 
the art for additional such molecules for use as therapeutic agents, diagnostic 
agents, and research tools and reagents. 

The present invention provides such polypeptides for these and 
other uses that should be apparent to those skilled in the art from the teachings 
herein. 

SUMMARY OF THE INVENTION 

Within one aspect of the invention there is provided an isolated 
polypeptide that is at least 80% identical in amino acid sequence to residues 1 
through 106 of SEQ ID N0:2. The polypeptide comprises cysteine residues at 
positions corresponding to residues 8, 34, 38, 66, 96, and 98 of SEQ ID N0:2, 
a glycine residue at a position corresponding to residue 36 of SEQ ID NO:2, 
and beta strands con-esponding to residues 9-17, 29-34, 38-43. 59-64, 67-71, 
and 90-95 of SEQ ID N0:2. Within one embodiment of the invention the 
isolated polypeptide further comprises cysteine residues at positions 
con-esponding to residues 25. 65, 80. and 101 of SEQ ID N0:2. Within a 
further embodiment, amino acid residues within the polypeptide at positions 
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corresponding to residues 8. 11, 12, 14, 29, 30, 32, 34, 43. 44. 60. 63. 64. 65. 
71. 74, 80, 90. 91, 93. and 94 of SEQ ID N0:2 are Cys. His. Pro, Asn. His, Val, 
Gin, Cys. Phe. Pro, Thr. Ser. Gin. Cys, Leu. Val. Cys, lie, Phe. Ala, and Arg. 
respectively, and an amino acid residue con'esponding to residue 75 of SEQ ID 
5 N0:2 Is Lys or Arg. Within other embodiments the isolated polypeptide 
comprises residue 1 through residue 106 of SEQ ID N0:2 or residue 1 through 
residue 106 of SEQ ID NO:29. Within further embodiments, the isolated 
polypeptide is covalently linked to an affinity tag or to an immunoglobulin 
constant region. 

10 Within a second aspect of the invention there is provided an 

isolated protein comprising a first polypeptide complexed to a second 
polypeptide wherein said protein modulates cell proliferation, differentiation, or 
metabolism. The first polypeptide is at least 80% identical in amino acid 
sequence to residues 1 through 106 of SEQ ID N0:2 and comprises cysteine 

15 residues at positions con-esponding to residues 8, 34, 38, 66. 96. and 98 of 
SEQ ID N0:2. a glycine residue at a position corresponding to residue 36 of 
SEQ ID N0:2. and beta strands corresponding to residuiss 9-17, 29-34, 38-43, 
59-64, 67-71, and 90-95 of SEQ ID N0:2. Within one embodiment the first 
polypeptide furttier comprises cysteine residues at positions con^esponding to 

2 0 residues 25, 65, 80, and 101 of SEQ ID N0:2. Within a further embodiment, 
amino acid residues of the first polypeptide corresponding to residues 8, 11. 12. 
14, 29. 30. 32, 34, 43, 44, 60. 63, 64. 65, 71, 74, 80, 90, 91. 93. and 94 of SEQ 
ID N0:2 are Cys, His, Pro, Asn, His, Val. Gin, Cys, Phe, Pro. Thr. Ser. Gin, 
Cys, Leu, Val. Cys. lie. Phe, Ala, and Arg. respectively; and an amino acid 

25 residue con^esponding to residue 75 of SEQ ID N0:2 is Lys or Arg. Within 
another embodiment the protein is a heterodimer. Within a related embodiment 
the second polypeptide is a glycoprotein hormone common alpha subunit. 
Within other embodiments the first polypeptide comprises residue 1 through 
residue 106 of SEQ ID N0:2 or residue 1 through residue 106 of SEQ ID 

30 NO:29. Within further embodiments, the protein is a homodimer, such as a 
homodimer of polypeptides comprising residue 1 through residue 106 of SEQ 
ID N0:2. or a homodimer of polypeptides comprising residue 1 through residue 
106 of SEQ IDNO:29. 

Within a third aspect of the invention there is provided an isolated 

35 polynucleotide encoding a polypeptide that is at least 90% identical in amino 
acid sequence to residues 1 through 106 of SEQ ID N0:2, wherein the 
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polypeptide comprises cysteine residues at positions corresponding to residues 
8. 34, 38, 66, 96, and 98 of SEQ ID N0:2, a glycine residue at a position 
corresponding to residue 36 of SEQ ID N0:2, and beta strands con-esponding 
to residues 9-17, 29-34. 38-43, 59-64. 67-71. and 90-95 of SEQ ID N0:2. 
Within one embodiment the polypeptide further comprises cysteine residues at 
positions coresponding to residues 25, 65, 80, and 101 of SEQ ID N0:2. 
Within another embodiment, amino acid residues of the polypeptide 
con-esponding to residues 8, 11, 12, 14. 29, 30, 32, 34, 43, 44, 60, 63. 64, 65, 
71, 74, 80, 90, 91. 93, and 94 of SEQ ID N0:2 are Cys, His. Pro, Asn. His, Val, 
Gin, Cys. Phe, Pro, Thr, Ser, Gin. Cys. Leu, Val, Cys. lie, Phe. Ala, and Arg, 
respectively, and an amino acid residue corresponding to residue 75 of SEQ ID 
N0:2 is Lys or Arg. Within certain additional embodiments of the invention, the 
polypeptide comprises residue 1 through residue 106 of SEQ ID N0:2 or 
residue 1 through residue 106 of SEQ ID NO:29. Within another embodiment 
the polynucleotide further encodes a secretory peptide operably linked to the 
polypeptide. Within additional embodiments the polynucleotide encodes 
residue -23 through residue 106 of SEQ ID N0:2 or residue -23 through 
residue 106 of SEQ ID NO:29. Within further embodiments the polynucleotide 
comprises a sequence of nucleotides as shown in SEQ ID N0:4 or SEQ ID 
NO:30 from nucleotide 70 through nucleotide 387. Within other embodiments, 
the polynucleotide comprises a sequence of nucleotides as shown in SEQ ID 
N0:1 from nucleotide 125 through nucleotide 442. Within an additional ^ 
embodiment the polynucleotide is from 318 to 1000 nucleotides in length. The 
polynucleotide can be DNA or RNA, 

Within a fourth aspect of the invention there is provided an 
expression vector comprising the following operably linked elements: (a) a 
transcription promoter; (b) a DNA segment encoding a polypeptide as disclosed 
above; and (c) a transcription terminator. Within one embodiment the DNA 
segment further encodes a secretory peptide operably linked to the 
polypeptide. Within further embodiments the DNA segment encodes residue - 
23 through residue 106 of SEQ ID N0:2 or residue -23 through residue 106 of 
SEQ ID NO:29. 

Within a fifth aspect of the invention there is provided a cultured 
cell into which has been introduced an expression vector as disclosed above, 
wherein the cell expresses the polypeptide encoded by the DNA segment. The 
cell can be used within a method of producing a polypeptide, wherein the 
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method comprises culturing the cell, whereby the cell expresses the 
polypeptide encoded by the DNA segment, and recovering the xpr ssed 
polypeptide. 

Within a further aspect of the invention there is provided an 
5 antibody that specifically binds to an epitope of a polypeptide as disclosed 
above. 

The invention also provides a method for detecting a genetic 
abnonnality in a patient, comprising the steps of (a) obtaining a genetic sample 
from a patient; (b) incubating the genetic sample with a polynucleotide 
10 comprising at least 14 contiguous nucleotides of SEQ ID N0:1 or the 
complement of SEQ ID N0:1, under conditions wherein the polynucleotide will 
hybridize to complementary polynucleotide sequence, to produce a first 
reaction product; (c) comparing the first reaction product to a control reaction 
product, wherein a difference between the first reaction product and the control 
15 reaction product is indicative of a genetic abnormality in the patient 

The invention also provides an oligonucleotide probe or primer 
. comprising 14 contiguous nucleotides of a polynucleotide of SEQ ID N0:4 or a 
sequence complementary to SEQ ID N0:4. Within one embodiment the probe 
or primer comprises 14 contiguous nucleotides of a polynucleotide of SEQ ID 
20 NO:1 or a sequence complementary to SEQ ID N0:1. 

The invention also provides a pharmaceutical composition 
comprising a polypeptide as disclosed above in combination with a 
pharmaceutically acceptable vehicle. 

These and other aspects of the invention will become evident 
25 upon reference to the following detailed description of the invention and the 
attached drawings. 

BRIEF DESCRIPTION OF THE DRAWING 

The figure is a Hopp/Woods hydrophilicity profile of the amino 
30 acid sequence shown in SEQ ID N0:2. The profile is based on a sliding six- 
residue window. Buried G, S, and T residues and exposed H. Y, and W 
residues were ignored. These residues are indicated in the figure by lower 
case letters. 
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DETAILED DESCRIPTION OF THE INVENTION 

Prior to setting forth the invention in detail, it may be helpful to the 
understanding thereof to define the following ternis: 

The term "affinity tag" is used herein to denote a polypeptide 
5 segment that can be attached to a second polypeptide to provide for 
purification of the second polypeptide or provide sites for attachment of the 
second polypeptide to a substrate. In principal, any peptide or protein for which 
an antibody or other specific binding agent is available can be used as an 
affinity tag. Affinity tags include a poly-histidine tract, protein A (Nilsson et al., 

10 EMBO J, 4:1075, 1985; Nilsson et al., Methods Enzymol. 198:3. 1991). 
glutathione S transferase (Smith and Johnson, Gene 67:31. 1988). Glu-Glu 
affinity tag (Grussenmeyer et al.. Proc. NatL Acad ScL USA 82:7952-7954, 
1985). substance P. Flag*^^ peptide (Hopp et al., B/btecWogy 6:1204-1210, 
1988), maltose binding protein (Kellemnan and Ferenci, Methods Enzymol. 

15 90:459-463, 1982; Guan et al., Gene 67:21-30, 1987). streptavidin binding 
peptide, thioredoxin, ubiquitin, cellulose binding protein, T7 polymerase, 
immunoglobulin constant domain, or other antigenic epitope or binding domain. 
See, in general, Ford et al., Protein Expression and Purification 2: 95-107, 
1991. DNAs encoding affinity tags and otehr reagents are available from 

20 commercial suppliers (e.g., Pharmacia Biotech. Piscataway. NJ; Eastman 
Kodak. New Haven, CT; New England Biolabs, Beverly, MA). 

The term "allelic variant" is used herein to denote any of two or 
more alternative forms of a gene occupying the same chromosomal locus. 
Allelic variation arises naturally through mutation, and may result in phenotypic 

25 polymorphism within populations. Gene mutations can be silent (no change in 
the encoded polypeptide) or may encode polypeptides having altered amino 
acid sequence. The term allelic variant is also used herein to denote a protein 
encoded by an allelic variant of a gene. 

-The terms "amino-terminal" and "carboxyl-temninal" are used 

30 herein to denote positions within polypeptides. Where the context allows, these 
tenms are used with reference to a particular sequence or portion of a 
polypeptide to denote proximity or relative position. For example, a certain 
sequence positioned carboxyl-terminal to a reference sequence within a 
polypeptide is located proximal to the carboxyl terminus of the reference 

35 sequence, but is not necessarily at the carboxyl tenninus of the complete 
polypeptide. 
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A "beta-strand-like region" is a region of a protein characterized 
by certain combinations of th polypeptide backbone dihedral angles phi {^) 
and psi (v). Regions wherein ^ is less than -60** and v|/ is greater than 90** are 
beta-strand-like. Those skilled in the art will recognize that the limits of a p- 
strand are somewhat imprecise and may vary with the criteria used to define 
them. See, for example. Richardson and Richardson in Fasman, ed., 
Prediction of Protein Structure and the Principles of Protein Conformation. 
Plenum Press. New Yori<. 1989; and Lesk, Protein Architecture: A Practical 
Approach . Oxford University Press, New Yori<, 1991. 

A "complement" of a polynucleotide molecule is a polynucleotide 
molecule having a complementary base sequence and reverse orientation as 
compared to a reference sequence. For example, the sequence 5' 
ATGCACGGG 3' is complementary to 5' CCCGTGCAT 3*. 

A plurality of polypeptide chains are "complexed with" each other 
when they are associated, covalently (e.g., by disulfide bonding) or non- 
covalently (e.g., by hydrogen bonding, hydrophobic interactions, or salt-bridge 
interactions), to form a protein having a characteristic biological activity. 

"Con^esponding to", when used in reference to a nucleotide or 
amino acid sequence, indicates the position in a second sequence that aligns 
with the reference position when two sequences are optimally aligned. 

The term "degenerate nucleotide sequence" denotes a sequence 
of nucleotides that includes one or more degenerate codons (as compared to a 
reference polynucleotide molecule that encodes a polypeptide). Degenerate 
codons contain different triplets of nucleotides, but encode the same amino 
acid residue (i.e., GAU and GAC triplets each encode Asp). 

The term "expression vector" is used to denote a DNA molecule, 
linear or circular, that comprises a segment encoding a polypeptide of interest 
operably linked to additional segments that provide for its transcription. Such 
additional segments include promoter and terminator sequences, and may also 
include one or more origins of replication, one or more selectable markers, an 
enhancer, a polyadenylation signal, etc. Expression vectors are generally 
derived from plasmid or viral DNA. or may contain elements of both. 

The term "isolated", when applied to a polynucleotide, denotes 
that the polynucleotide has been removed from its natural genetic milieu and is 
thus free of other extraneous or unwanted coding sequences, and is in a forni 
suitable for use within genetically engineered protein production systems. 
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Such isolated molecul s are those that are separated from their natural 
environment and include cDNA and genomic clones. Isolated DNA molecules 
of the present invention are free of other genes with which they are ordinarily 
associated, but may include naturally occuning 5' and 3* untranslated regions 
such as promoters and temninators. The identification of associated regions 
will be evident to one of ordinary skill in the art (see for example, Dynan and 
Tijan, A/afty/e 316:774-78, 1985). 

An "isolated" polypeptide or protein is a polypeptide or protein that 
is found in a condition other than its native environment, such as apart from 
blood and animal tissue. In a preferred fomi, the isolated polypeptide or protein 
is substantially free of other polypeptides or proteins, particularly other 
polypeptides or proteins of animal origin. It is prefen-ed to provide the 
polypeptides or proteins in a highly purified forni, i.e. greater than 95% pure, 
more preferably greater than 99% pure. When used in this context, the temi 
"isolated" does not exclude the presence of the same polypeptide or protein in 
alternative physical fomns. such as dimers or alternatively glycosylated or 
derivatized forms. 

"Operably linked", when refening to DNA segments, Indicates that 
the segments are arranged so that they function in concert for their intended 
purposes, e.g., transcription initiates in the promoter and proceeds through the 
coding segment to the tenminator. 

The temi "orthologV denotes a polypeptide or protein obtained 
from one species that is the functional counterpart of a polypeptide or protein 
from a different species. Sequence differences among orthologs are the result 
of speciation. 

A "polynucleotide" is a single- or double-stranded polymer of 
deoxyribonucleotide or ribonucleotide bases read from the 5' to the 3' end. 
Polynucleotides include RNA and DNA, and may be isolated from natural 
sources, synthesized in vitro, or prepared from a combination of natural and 
synthetic molecules. Sizes of polynucleotides are expressed as base pairs 
(abbreviated "bp"), nucleotides ("nt"). or kilobases ("kb"). Where the context 
allows, the latter two tenms may describe polynucleotides that are single- 
stranded or double-stranded. When the tenn is applied to double-stranded 
molecules it is used to denote overall length and will be understood to be 
equivalent to the term "base pairs". It will be recognized by those skilled in the 
art that the two strands of a double-stranded polynucleotide may differ slightly 
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in length and that the ends thereof may be staggered as a result of enzymatic 
cleavage; thus all nucleotides within a double-stranded polynucleotide molecule 
may not be paired. Such unpaired ends will in general not exceed 20 nt in 
length. 

A "polypeptide" is a polymer of amino acid residues joined by 
peptide bonds, whether produced naturally or synthetically. Polypeptides of 
less than about 10 amino acid residues are commonly referred to as "peptides". 

The term "promoter" is used herein for its art-recognized meaning 
to denote a portion of a gene containing DNA sequences that provide for the 
binding of RNA polymerase and initiation of transcription. Promoter sequences 
are commonly, but not always, found in the 5* non-coding regions of genes, 

A "protein" is a macromolecule comprising one or more 
polypeptide chains. A protein may also comprise non-peptidic components, 
such as carbohydrate groups. CariDohydrates and other non-peptidic 
substituents may be added to a protein by the cell in which the protein is 
produced, and will vary with the type of cell. Proteins are defined herein in 
temis of tiieir amino acid backbone structures; substituents such as 
carbohydrate groups are generally not specified, but may be present 
nonetheless. 

A "secretory signal sequence" is a DNA sequence that encodes a 
polypeptide (a "secretory peptide") that, as a component of a larger 
polypeptide, directs the larger polypeptide through a secretory pathway of a cell 
in which it is synthesized. The larger polypeptide is commonly cleaved to 
remove the secretory peptide during transit through the secretory pathway. 

Molecular weights and lengths of polymers determined by 
imprecise analytical methods (e.g., gel electrophoresis) will be understood to 
be approximate values. When such a value is expressed as "abouf X or 
"approximately" X. the stated value of X will be understood to be accurate to 
±10%. 

All references cited herein are incorporated by reference in their 

entirety. 

The present invention is based in part upon the discovery of a 
novel DNA molecule that encodes a polypeptide having a secretory peptide 
and an an*angement of cysteine residues and beta strand-like regions that is 
characteristic of the gonadotropin family of glycoprotein hormones. The DNA 
mol cule was originally identified in a library of cDNAs derived from pancreatic 
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Strand 3 variable loop 2 ^ beta strand 4 ^ cystine knot ^ beta strand 5 ^ 
variable loop 3 -> beta strand 6 cystine knot. Variable loop 1 is disulfid 
bonded to variable loop 2 to form one side of the bow tie. with variable loop 3 
fomiing the other side. 

Structural analysis and homology indicate that zsig51 
polypeptides complex with a second polypeptide to form multimeric proteins. 
These proteins include homodimers and heterodimers. In the latter case, the 
second polypeptide can be a truncated or other variant zsig51 polypeptide or 
another polypeptide, such as a glycoprotein homione subunit, TGF-(J 
polypeptide, a GDF polypeptide, or a bone morphogenic protein (BMP) 
polypeptide. Among the dimeric proteins within the present invention are 
dimers formed by non-covalent association (e.g., hydrophobic interactions) with 
a second subunit, either a second zsigSI polypeptide or other second subunit, 
such as a common a subunit. Within these dimers, loops 1 and 3 of monomer 
1 interact with loop 2 of monomer 2. and loop 2 of monomer 1 interacts with 
loops 1 and 3 of monomer 2. In addition, dimerization may occur via 
intemiolecular disulfide bond formation. Alignment with TGF-p indicates that 
Cys-65 may participate in an intermolecular disulfide bond. 

The present invention also provides isolated polypeptides that are 
substantially homologous to the polypeptides of SEQ ID N0:2 and their 
orthologs. Such polypeptides will preferably be at least 95% or more identical 
to residues 1-106 of SEQ ID N0;2 or its orthologs. Percent sequence identity 
is determined by conventional methods. See. for example, Altschul et al., Bull. 
Math. Bio. 48:603-616. 1986. and Henikoff and Henikoff. Proc. Natl. Acad ScL 
USA 89:10915-10919, 1992. Briefly, two amino acid sequences are aligned to 
optimize the alignment scores using a gap opening penalty of 10. a gap 
extension penalty of 1. and the "BLOSUM62" scoring matrix of Henikoff and 
Henikoff (ibid,) as shown in Table 1 (amino acids are indicated by the standard 
one-letter codes). The percent identity is then calculated as: 

Total number of identical matches 
xlOO 

[length of the longer sequence plus the 

number of gaps introduced into the longer 

sequence in order to align the two 

sequences] 



I 
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The level of identity between amino acid sequ nces can be 
determined using the "FASTA" similarity search algorithm of Pearson and 
Lipman {Pmc. Natl, Acad, Sci. USA 85:2444. 1988). and by Pearson {Meth. 
5 EnzymoL 183:63, 1990). Briefly. FASTA first characterizes sequence 
similarity by identifying regions shared by the query sequence (e.g., SEQ ID 
N0:2) and a test sequence that have either the highest density of identities (if 
the ktup variable is 1) or pairs of identities (if ktup=2). without considering 
conservative amino acid substitutions, insertions, or deletions. The ten 

10 regions with the highest density of identities are then rescored by comparing 
the similarity of all paired amino acids using an amino acid substitution 
matrix, and the ends of the regions are "trimmed" to include only those 
residues that contribute to the highest score. If there are several regions with 
scores greater than the "cutoff value (calculated by a predetermined formula 

15 based upon the length of the sequence and the ktup value), then the trimmed 
initial regions are examined to detemiine whether the regions can be joined 
to form an approximate alignment with gaps. Finally, the highest scoring 
regions of the two amino acid sequences are aligned using a modification of 
the Needleman-Wunsch-Sellers algorithm (Needleman and Wunsch. J. MoL 

20 Biol. 48:444, 1970; Sellers. SIAM J. AppL Math. 26:787, 1974), which allows 
for amino acid insertions and deletions. Illustrative parameters for FASTA 
analysis are: ktup=1, gap opening penalty=10, gap extension penalty=1, and 
substitution matrix=BLOSUM62. These parameters can be introduced into a 
FASTA program by modifying the scoring matrix file ("SMATRIX"), as 

25 explained in Appendix 2 of Pearson. 1990 {ibid.). 

FASTA can also be used to determine the sequence identity of 
nucleic acid molecules using a ratio as disclosed above. For nucleotide 
sequence comparisons, the ktup value can range between one to six. 
preferably from four to six. 

3 0 The present invention includes polypeptides having one or 

more conservative amino acid changes as compared with the amino acid 
sequence of SEQ ID N0:2. The BLOSUM62 matrix (Table 1) is an amino 
acid substitution matrix derived from about 2,000 local multiple alignments of 
protein sequence segments, representing highly conserved regions of more 

35 than 500 groups of related proteins (Henikoff and Henikoff. ibid.). Thus, the 
BLOSUM62 substitution frequencies can be used to define conservative 
amino acid substitutions that may be introduced into the amino acid 
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sequ nces of the present invention. As used herein, the term "conservative 
amino acid substitution" refers to a substitution represented by a BLOSUM62 
value of greater than -1. For example, an amino acid substitution is 
consen/ative if the substitution is characterized by a BLOSUM62 value of 0, 
5 1 , 2, or 3. Preferred conservative amino acid substitutions are characterized 
by a BLOSUM62 value of at least one 1 (e.g., 1. 2 or 3), while more prefen^ed 
conservative amino acid substitutions are characterized by a BLOSUM62 
value of at least 2 (e.g., 2 or 3). 

Substantially homologous proteins and polypeptides are 

10 characterized as having one or more amino acid substitutions, deletions or 
additions. These changes are preferably of a minor nature, that is 
conservative amino acid substitutions and other changes that do not 
significantly affect the folding or activity of the protein or polypeptide, and 
include amino- or carboxyl-tenninal extensions, such as an amino-temninal 

15 methionine residue, a small linker peptide of up to about 20-25 residues, or 
an extension that facilitates purification (an affinity tag) as disclosed above. 
Proteins having such an extension will preferably comprise a region that is at 
least 95% identical to residues 1 through 106 of SEQ ID N0:2. or an ortholog 
thereof. 

20 The present invention further provides a variety of other 

polypeptide fusions and related multimeric proteins comprising one or more 
polypeptide fusions. For example, a zsig51 polypeptide can be prepared as 
a fusion to a dimerizing protein as disclosed in U.S. Patents Nos. 5,155.027 
and 5,567,584. Prefenred dimerizing proteins in this regard include 

25 immunoglobulin constant region domains. lmmunoglobulin-zsig51 
polypeptide fusions can be expressed in genetically engineered cells to 
produce a variety of multimeric zsigSI analogs. Auxiliary domains can be 
fused to zsigSI polypeptides to target them to specific cells, tissues, or 
macromolecules (e.g.. collagen). For example, a zsig51 polypeptide or 

30 protein can be targeted to a predetermined cell type by fusing a zsig51 
polypeptide to a ligand that specifically binds to a receptor on the surface of 
the target cell. In this way. polypeptides and proteins can be targeted for 
therapeutic or diagnostic purposes. A zsig51 polypeptide can be fused to 
two or more moieties, such as an affinity tag for purification and a targeting 

3 5 domain. Polypeptide fusions can also comprise one or more cleavage sites, 
particularly between domains. See. Tuan et al.. Connective Tissue Research 
34:1-9. 1996. 
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The polypeptides of the present invention can also comprise 
non-naturally occurring amino acid residues. Non-naturally occuning amino 
acids include, without limitation, frans-3-methylproline. 2.4-methanoproline, 
c/5-4-hydrbxyproline, frans-4-hydroxyproline, A/-methylglycine, a//o-threonine, 
5 methylthreonine, hydroxyethylcysteine. hydroxyethylhomocysteine. 
nitroglutamine, homoglutamine, pipecolic acid, tert-leucine. norvaiine, 2- 
azaphenylalanine, 3-azaphenylalanine. 4-a2aphenylalanine, and 4- 
fluorophenyialanine. Several methods are known in the art for incorporating 
non-naturally occurring amino acid residues into proteins. For example, an in 

10 vitro system can be employed wherein nonsense mutations are suppressed 
using chemically aminoacylated suppressor tRNAs, an £ coli S30 extract, 
and commercially available enzymes and other reagents. See. for example. 
Robertson et al., J. Am, Cfiem. Soc. 113:2722. 1991; Ellman et al.. Methods 
Enzymoi 202:301, 1991; Chung et a!., Science 259:806-809. 1993; and 

15 Chung et al., Proc. Natl. Acad. Sci. USA 90:10145-10149. 1993). In a 
second method, translation is canried out in Xenopus oocytes by 
microinjection of mutated mRNA and chemically aminoacylated suppressor 
tRNAs (Turcatti et al.. J. Biol. Chem. 271:19991-19998. 1996). Within a third 
method, E. coli cells are cultured in the absence of a natural amino acid that 

20 is to be replaced (e.g., phenylalanine) and in the presence of the desired 
non-naturally occurring amino acid(s) (e.g., 2-azaphenylalanine. 3- 
azaphenylalanine, 4-azaphenylalanine, or 4-f!uorophenylalanine). See. 
Koide et al., Biochem . 33:7470-7476, 1994. Naturally occuning amino acid 
residues can be converted to non-naturally occurring species by in vitro 

25 chemical modification. Chemical modification can be combined with site- 
directed mutagenesis to further expand the range of substitutions (Wynn and 
Richards. Protein Sci. 2:395-403. 1993). 

Amino acid sequence changes are made in zsigSI polypeptides 
so as to minimize disruption of higher order stmcture essential to biological 

3 0 activity. Changes in amino acid residues will be made so as not to disrupt 
the cystine knot and "bow tie" arrangement of loops that is characteristic of 
the protein family. The effects of amino acid sequence changes can be 
predicted by computer modeling as disclosed above or determined by 
analysis of crystal structure (see, e.g., Lapthorn et al.. ibid.). A 

35 hydrophobicity profile of SEQ ID NO:2 is shown in the attached figure. Those 
skilled in the art will recognize that this hydrophobicity will be taken into 
account when designing alterations in the amino acid sequence of a zsigSI 
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polypeptide, so as not to disrupt the overall profit . Alignment of zsig51 with 
other family members also provides guidance in selecting amino acid 
substitutions, particularly if infonnation about the effects of amino acid 
substitutions in other family members is available. For example, alignment 
5 suggests that residue 95 (Ala) can be replaced witfi Ser. This variant 
sequence is shown in SEQ ID NO:29. Alignment with NDP, taking into 
account reported deleterious mutations, indicates that residues 8, 11, 12. 14, 
29. 30, 32, 34. 43. 44. 60, 63, 64, 65, 71. 74. 75. 80. 90. 91. 93. and 94 may 
be relatively intolerant of substitution or deletion. Residue 75 (Lys in SEQ ID 
10 N0:2) may be conservatively replaced with Arg. The region of zsig51 from 
the penultimate Cys residue to the carboxyl terminus (residues 98 to 106 of 
SEQ ID N0:2) may be important for receptor specificity. 

Essential amino acids in the polypeptides of \he present 
invention can be identified according to procedures known in ttie art. such as 

15 site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham 
and Wells, Science 244, 1081-1085. 1989; Bass et al.. Proc. Natl. Acad. Sci. 
USA 88:4498-4502. 1991), Multiple amino acid substitutions can be made 
and tested using known methods of mutagenesis and screening, such as 
those disclosed by Reidhaar-Olson and Sauer {Science 241 :53-57. 1988) or 

20 Bowie and Sauer (Prac. Natl. Acad Sci. USA 86:2152-2156. 1989). Other 
methods that can be used include phage display (e.g.. Lowman et al., 
Biochem. 30:10832-10837, 1991; Ladner et al., U.S. Patent No. 5.223,409; 
Huse, WlPO Publication WO 92/06204) and region-directed mutagenesis 
(Derbyshire et al.. Gene 46:145. 1986; Neret al., DA/A 7:127, 1988). 

25 Variants of the disclosed zsigSI DNA and polypeptide 

sequences can be generated through DNA shuffling as disclosed by 
Stemmer. Nature 370:389-391. 1994 and Stemmer, Proc. Natl. Acad Sci. 
USA 91:10747-10751. 1994. Briefly, variant genes are generated by in vitro 
homologous recombination by random fragmentation of a parent gene 

30 followed by reassembly using PGR, resulting in randomly introduced point 
mutations. 

Mutagenesis methods as disclosed above can be combined 
with high volume or high-throughput screening methods to detect biological 
activity of zsig51 variant polypeptides, in particular biological activity in 
35 modulating cell proliferation or cell differentiation. For example, mitogenesis 
assays that measure dye incorporation or ^H-thymidine incorporation can be 
earned out on large! numbers of samples, as can cell-based assays tiiat 
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detect expression of a reporter gene (e.g., a luciferase gene). These and 
other assays are disclosed in more detail below. Mutagenized DNA 
molecules that encode active zsig51 polypeptides can be recovered from the 
host cells and rapidly sequenced using modem equipment. These methods 
5 allow the rapid determination of the importance of individual amino acid 
residues in a polypeptide of interest, and can be applied to polypeptides of 
unknown structure. 

Using the methods discussed above, one of ordinary skill in the 
art can identify and/or prepare a variety of polypeptides that are substantially 

10 homologous to residues 1 through 106 of SEQ ID NO: 2 or allelic variants or 
orthologs thereof and retain the biological properties of the wild-type protein. 
Such polypeptides can also include additional polypeptide segments as 
generally disclosed above. 

The present invention also provides polynucleotide molecules, 

15 including DNA and RNA molecules, that encode the 2sig51 polypeptides 
disclosed above. Those skilled in the art will readily recognize that, in view of 
the degeneracy of the genetic code, considerable sequence variation is 
possible among these polynucleotide molecules. SEQ ID N0:4 is a 
degenerate DNA sequence that encompasses all DNAs that encode the 

20 zsig51 polypeptide of SEQ ID NO: 2. Those skilled in the art will recognize 
that the, degenerate sequence of SEQ ID N0:4 also provides all RNA 
sequences encoding SEQ ID N0:2 by substituting U for T. Thus, zsig51 
polypeptide-encoding polynucleotides comprising nucleotide 70 to nucleotide 
387 of SEQ ID NO: 4 and their RNA equivalents are contemplated by the 

2 5 present invention. A degenerate sequence encoding SEQ ID NO:29 is 
shown in SEQ ID NO:30. Table 2 sets forth the one-letter codes used within 
SEQ ID N0:4 and SEQ ID NO:30 to denote degenerate nucleotide positions. 
"Resolutions" are the nucleotides denoted by a code letter. "Complement" 
indicates the code for the complementary nucleotide(s). For example, the 

30 code Y denotes either C or T, and its complement R denotes A or G, A being 
complementary to T, and G being complementary to C. 
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TABLE 3 



Amino 


One-Letter 




Degenerate 


Acid 


Code 


Codons 


Codon 


Cys 


C 


TGC TGT 


TGY 


Ser 


S 


AGC AGT TCA TCC TCG TCT 


WSN 


Thr 


T 


M. A A ^^^^ M. M. ^^^^ 

ACA ACC ACG ACT 


CAN 


Pro 


P 


CCA CCC CCG CCT 


CCN 


Ala 


A 


GCA GCC GCG GCT 


GCN 


Gly 


G 


GGA GGC GGG GGT 


GGN 


Asn 


N 


AAC AAT 


AAV 


Asp 


D 


GAC GAT 


GAY 


Glu 


E 


GAAGAG 


GAR 


Gin 


Q, 


CAACAG 


CAR 


His 


H 


CAC CAT 


CAY 


Arg 


R 


AGA AGG CGA CGC CGG CGT 


MGN 


Lys 


K 


AAAAAG 


AAR 


Met 


M 


ATG 


ATG 


lie 


1 


ATA ATC ATT 


ATH 


Leu 


L 


CTA CTC CTG CTT TTA TTG 


YTN 


Val 


V 


GTA GTC GTG GTT 


GTN 


Phe 


F 


TTCTTT 


TTY 


Tyr 


Y 


TAC TAT 


TAY 


Trp 


W 


TGG 


TGG 


Ter 




TAA TAG TGA 


TRR 


AsnjAsp 


B 




RAY 


GiulGIn 


Z 




SAR 


Any 


X 




NNN 


Gap 









One of ordinary skill in the art will appreciate that some 
5 ambiguity is introduced in detemriining a degenerate codon, representative of 
all possible codons encoding each amino acid. For example, the degenerate 
codon for serine (WSN) can, in some circumstances, encode arginine (AGR). 
and the degenerate codon for arginine (MGN) can. in some circumstances, 
encode serine (AGY). A similar r lationship exists between codons encoding 
10 phenylalanine and leucine. Thus, some polynucleotides encompassed by 
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the degenerate sequences may encode variant amino acid sequences, but 
one of ordinary skill in the art can easily identify such variant sequences by 
reference to the amino acid sequence of SEQ ID NO: 2 or SEQ ID NO:29. 
Variant sequences can be readily tested for functionality as described herein. 
5 For any 2sig51 polypeptide, including variants and fusion 

proteins, one of ordinary skill in the art can readily generate a fully 
degenerate polynucleotide sequence encoding that variant using the 
information set forth in Tables 2 and 3 above. Moreover, those of skill in the 
art can use standard software to devise zsig51 variants based upon the 

10 nucleotide and amino acid sequences described herein. The present 
invention thus provides a computer-readable medium encoded with a data 
structure that provides at least one of the following sequences: SEQ ID N0:1, 
SEQ ID N0:2. SEQ ID N0:3, SEQ ID N0:4, SEQ ID NO:29, SEQ ID NO:30, 
SEQ ID N0:31, SEQ ID NO:32, SEQ ID NO:38, and SEQ ID NO:39. Suitable 

15 forms of computer-readable media include magnetic media and optically- 
readable media. Examples of magnetic media include a hard or fixed drive, a 
random access memory (RAM) chip, a floppy disk, digital.linear tape (DLT), a 
disk cache, and a ZIP disk. Optically readable media are exemplified by 
compact discs (e.g., CD-read only memory (ROM), CD-rewritable (RW), and 

20 CD-recordable), and digital versatile/video discs (DVD) (e.g.. DVD-ROM, 
DVD-RAM, and DVD+RW). 

Within preferred embodiments of the invention the isolated 
polynucleotides will hybridize to similar sized regions of SEQ ID N0:1, or a 
sequence complementary thereto, under stringent conditions. In general. 

25 stringent conditions are selected to be about 5^0 lower than the thermal 
melting point (T^) for the specific sequence at a defined ionic strength and 
pH. The Tm is the temperature (under defined ionic strength and pH) at 
which 50% of the target sequence hybridizes to a perfectly matched probe. 
Typical stringent conditions are those in which the salt concentration is up to 

3 0 about 0.03 M at pH 7 and the temperature is at least about 60^C. 

As previously noted, the isolated polynucleotides of the present 
invention include DNA and RNA. Methods for preparing DNA and RNA are 
well known in the art. Complementary DNA (cDNA) clones are prepared from 
RNA that is isolated from a tissue or cell that produces large amounts of 

35 zsig51 RNA. Such tissues and cells are identified by Northern blotting 
(Thomas, Proa NatL Acad ScL USA 77:5201, 1980). and include pancreas, 
pituitary, testis, and eye. Total RNA can be prepared using guanidine HCI 
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extraction followed by isolation by centrifugation in a CsCI gradi nt (Chirgwin 
et al.. Biochemistry 18:52-94, 1979). Poly (A)+ RNA is prepared from total 
RNA using the method of Aviv and Leder {Proc. Natl, Acad Sc/. USA 
69:1408-1412, 1972). Complementary DNA (cDNA) is prepared from 
5 poly(A)"^ RNA using known methods. In the alternative, genomic DNA can be 
isolated. For some applications (e.g., expression in transgenic animals) it 
may be preferable to use a genomic clone, or to modify a cDNA clone to 
include at least one genomic intron. Methods for identifying and isolating 
cDNA and genomic clones are well known and within the level of ordinary 
0 skill in the art, and include the use of the sequence disclosed herein, or parts 
thereof, for probing or priming a library. Polynucleotides encoding zsigSI 
polypeptides are identified and isolated by, for example, hybridization or 
polymerase chain reaction (TCR", Mullis, U.S. Patent 4,683.202). 
Expression libraries can be probed with antibodies to zsig51, receptor 
fragments, or other specific binding partners. 

Those skilled in the art will recognize that the sequences 
disclosed in SEQ ID N0S:1 and 2 represent a single allele of human zsigSI. 
Allelic variants of these sequences can be cloned by probing cDNA or 
genomic libraries from different individuals according to standard procedures. 

The present invention further provides counterpart 
polynucleotides and polypeptides from other species (orthologs). Of 
particular interest are zsig51 polynucleotides and polypeptides from other 
mammalian species, including murine, rat, porcine, ovine, bovine, canine, 
feline, equine and other primate proteins. Orthologs of the human 
polynucleotides can be cloned using infomnation and compositions provided 
by the present invention in combination with conventional cloning techniques. 
For example, a cDNA can be cloned using mRNA obtained from a tissue or 
cell type that expresses the protein. Suitable sources of mRNA can be 
identified by probing Northern blots with probes designed from the sequences 
disclosed herein. A library is then prepared from mRNA of a positive tissue 
or cell line. A zsigSI-encoding cDNA can then be isolated by a variety of 
methods, such as by probing with a complete or partial human cDNA or with 
one or more sets of degenerate probes based on the disclosed sequences. 
A cDNA can also be cloned by PGR using primers designed from the 
sequences disclosed herein. Within an additional method, the cDNA library 
can be used to transfonn or transfect host cells, and expression of the cDNA 
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2sig51 residues 63-68 

degenerate: WSN CAR TGY TGY ACN AT (SEQ ID NO: 17) 
consensus: WSN CAN TGY TGY MSN MY (SEQ ID NO: 18) 
complement: WSN GTN ACR ACR KSN KR (SEQ ID N0:19) 

2sig51 residues 11-16 

degenerate: CAY CCN TTY AAY GTN AC (SEQ ID NO:20) 
consensus: MRN CMN YWY WAY GTN RM (SEQ ID N0:21 ) 
complement: KYN GKN RWR WTR CAN YK (SEQ ID NO:22) 

ZsigSI polynucleotide sequences disclosed herein can also be 
used as probes or primers to clone 5' non-coding regions of a zsigSI gene. 
In view of the tissue-specific expression observed for zsig51 by Northern 
blotting, this gene region is expected to provide for pancreas-, testis-, eye-, 
and pituitary-specific expression. Promoter elements from a zsigSI gene 
could thus be used to direct the tissue-specific expression of heterologous 
genes in, for example, transgenic animals or patients treated with gene 
therapy. Cloning of 5' flanking sequences also feciiitates production of 
zsig51 proteins by "gene activation" as disclosed in U.S. Patent No. 
5,641,670. Briefly, expression of an endogenous zsigSI gene in a cell is 
altered by introducing into the zsigSI locus a DNA construct comprising at 
least a targeting sequence, a regulatory sequence, an exon, and an unpaired 
splice donor site. The targeting sequence is a zsigSI 5' non-coding 
sequence that permits homologous recombination of the construct with the 
endogenous zsigSI locus, whereby the sequences within the construct 
become operably linked with the endogenous zsigS! coding sequence. In 
this way. an endogenous zsigSI promoter can be replaced or supplemented 
with other regulatory sequences to provide enhanced, tissue-specific, or 
otherwise regulated expression. 

The polynucleotides of the present invention can also be 
prepared by automated synthesis. Synthesis of polynucleotides is within the 
level of ordinary skill in the art, and suitable equipment and reagents are 
available from commercial suppliers. See. in general. Click and Pasternak, 
Molecular Biotechnoloqv. Principles & Applications of Recombinant DNA , 
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interest, although certain signal sequences may be positioned elsewh re in 
the DNA sequence of interest (se . e.g., Welch et al., U.S. Patent No. 
5.037,743; Holland et al.. U.S. Patent No. 5,143,830). 

Expression of zsigSI polypeptides via a host cell secretory 
5 pathway is expected to result in the production of multimeric proteins. As 
noted above, such multimers include both homomultimers and 
heteromultimers, the latter including proteins comprising only zsig51 
polypeptides and proteins including 2sig51 and heterologous polypeptides. 
For example, a heteromultimer comprising a zsigSI polypeptide and a 
10 common alpha subunit can be produced by co-expression of the two 
polypeptides in a host cell. A cDNA sequence encoding a common alpha 
subunit is disclosed by Fiddes and Goodman, A/afure 281:351-356, 1979. A 
cDNA encoding the beta subunit of human chorionic gonadotropin is 
disclosed by Fiddes and Goodman, Nature 286:684-687. 1980. Berger et al., 
15 Nature Genetics 1:199-203, 1992 disclose cDNA clones encoding Nome 
disease protein. A TGF-p cDNA is disclosed by Derynck et al., Nature 
316:701-705, 1985. If a mixture of proteins results from expression, 
individual species are isolated by conventional methods. Monomers, dimers. 
and higher order multimers are separated by, for example, size exclusion 
20 chromatography. Heteromultimers can be separated from homomultimers by 
immunoaffinity chromatography using antibodies specific for individual dimers 
or by sequential immunoaffinity steps using antibodies specific for individual 
component polypeptides. See, in general. U.S. Patent No. 5,094,941. 
Multimers may also be assembled in vitro upon incubation of component 
25 polypeptides under suitable conditions. Recovery and assembly of proteins 
expressed in bacterial cells is disclosed below. 

Cultured mammalian cells are suitable hosts for use within the 
present invention. Methods for introducing exogenous DNA into mammalian 
host cells include calcium phosphate-mediated transfection (Wigler et aL, Cell 
30 14:725. 1978; Corsaro and Pearson, Somatic Cell Genetics 7:603. 1981: 
Graham and Van der Eb. VZ/o/ogy 52:456. 1973), electroporation (Neumann 
et al.. EMBO J. 1:841-845, 1982), DEAE-dextran mediated transfection 
(Ausubel et al.. ibid.), and liposome-mediated transfection (Hawley-Nelson et 
al.. Focus 15:73, 1993; Ciccarone et al.. Focas 15:80, 1993). The production 
35 of recombinant polypeptides in cultured mammalian cells is disclosed by, for 
example, Levinson et al., U.S. Patent No. 4,713,339; Hagen et al.. U.S. 
Patent No. 4.784,950; Palmiter et al.. U.S. Patent No. 4,579,821; and 
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Insect cells can be infected with recombinant baculovirus. 
commonly derived from Autographa califomica nuci ar polyhedrosis virus 
(AcNPV). See. King and Possee, The Baculovirus Expression Svstem: A 
Laboratory Guide . London. Chapman & Hall; O'Reilly et al., Baculovirus 
5 Expression Vectors: A Laboratory Manual . New York. Oxford University 
Press., 1994; and Richardson, Ed.. Baculovirus Expression Protocols. 
Methods in Molecular Biology, Humana Press, Totowa. NJ, 1995. 
Recombinant baculovirus can also be produced through the use of a 
transposon-based system described by Luckow et al. (J. ViroL 67:4566-4579. 

10 1993). This system, which utilizes transfer vectors, is commercially available 
in kit form (Bac-to-Bac^" kit; Life Technologies, Rockville, MD). The transfer 
vector (e.g., pFastBacI™; Life Technologies) contains a Tn7 transposon to 
move the DNA encoding the protein of interest into a baculovirus genome 
maintained in E. coli as a large plasmid called a "bacmid." See, Hill-Pericins 

15 and Possee, J. Gen, Virol. 71:971-976. 1990; Bonning et al., J. Gen, ViroL 
75:1551-1556. 1994; and Chazenbalk and Rapoport, J. Biol, Chem. 
270:1543-1549, 1995. In addition, transfer vectors can include an in-frame 
fusion with DNA encoding a polypeptide extension or affinity tag as disclosed 
above. Using techniques known in the art, a transfer vector containing a 

20 2sig51 -encoding sequence is transfomied into E. coli host cells, and the cells 
are screened for bacmids which contain an inten'upted lacZ gene indicative of 
recombinant baculovirus. The bacmid DNA containing the recombinant 
baculovirus genome is isolated, using common techniques, and used to 
transfect Spodoptera frugiperda cells, such as Sf9 cells. Recombinant vims 

25 that expresses zsig51 protein is subsequently produced. Recombinant viral 
stocks are made by methods commonly used the art. 

For protein production, the recombinant virus is used to infect 
host cells, typically a cell line derived from the fall armyworm, Spodoptera 
frugiperda (e.g.. Sf9 or Sf21 cells) or Trichoplusia ni (e.g., High Five^" cells; 

30 Invitrogen. Carlsbad. CA). See, in general, Glick and Pastemak, Molecular 
Biotechnology: Principles and Applications of Recombinant DNA . ASM 
Press. Washington, D.C., 1994. See also, U.S. Patent No. 5,300.435. 
Serum-free media are used to grow and maintain the cells. Suitable media 
formulations are known in the art and can be obtained from commercial 

35 suppliers. The cells are grown up from an inoculation density of 
approximately 2-5 x 10^ cells to a density of 1-2 x 10® cells, at which tim a 
recombinant viral stock is added at a multiplicity of infection (MOI) of 0.1 to 
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10. more typically near 3. Procedures used are generally described in 
available laboratory manuals (e.g., King and Possee, ibid.: O'R illy et al,, 
ibid: Richardson, ibid.). 

Fungal cells, including yeast cells, can also be used within the 
5 present invention. Yeast species of particular interest in this regard include 
Saccharomyces cerevisiae, Pichia pastoris, and Pichia methanolica. 
Methods for transforming S. cerevisiae cells with exogenous DNA and 
producing recombinant polypeptides therefrom are disclosed by. for example, 
Kawasaki, U.S. Patent No. 4,599,311; Kawasaki et a!,. U.S. Patent No. 

10 4,931,373; Brake. U.S. Patent No. 4,870,008; Welch et a!.. U.S. Patent No. 
5,037.743; and Murray et al., U.S. Patent No. 4,845,075. Transformation 
systems for other yeasts, including Hansenula polymorplia, 
Schizosaccliaromyces pombe, Kluyveromyces /acf/s, Kluyveromyces fragiiis, 
Ustilago maydis, Pichia pastoris, Pichia methanolica, Pichia guillermondii and 

15 Candida maltosa are known in the art. See, for example, Gleeson et al.. J. 
Gen. Microbiol. 132:3459-3465. 1986 and Cregg, U.S. Patent No. 4.882.279. 
The use of Pichia methanolica as host for the production of recombinant 
proteins is disclosed in U.S. Patents No. 5,716.808 and No. 5,736.383, and 
WlPO Publications WO 97/17450 and W097/17451. Aspergillus cells may 

20 be utilized according to the methods of McKnight et al.. U.S. Patent No. 
4,935,349. Methods for transforming Acremonium chrysogenum are 
disclosed by Sumino et al., U.S. Patent No. 5,162,228. Methods for 
transfomning Neurospora are disclosed by Lambowitz, U.S. Patent No. 
4,486.533. 

25 Prokaryotic host cells, including strains of the bacteria 

Escherichia coli. Bacillus and other genera are also useful host cells within 
the present invention. Techniques for transforming these hosts and 
expressing foreign DNA sequences cloned therein are well known in the art 
(see, e.g., Sambrook et al.. Ibid.). 

30 Transformed or transfected host cells are cultured according to 

conventional procedures in a culture medium containing nutrients and other 
components required for the growth of the chosen host cells. A variety of 
suitable media, including defined media and complex media, are known in 
the art and generally include a carbon source, a nitrogen source, essential 

3 5 amino acids, vitamins and minerals. Media may also contain such 
components as growth factors or serum, as required. The growth medium 
will generally select for cells containing the exogenously added DNA by, for 
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Rapp, Chem, Pept Prot 3:3, 1986; and Atherton et al., Solid Phas Peotid 
Synthesis: A Practical App roach |RL Press. Oxford, 1989. 

Using methods known in the art, zsigSI proteins can be 
prepared as monomers or multimers; glycosylated or non-glycosylated; 
5 pegylated or non-pegylated; and may or may not include an initial methionine 
amino acid residue. 

Activity of zsig51 polypeptides and proteins can be measured in 
vitro using cultured cells or in vivo by administering molecules of the claimed 
invention to the appropriate animal model. For example, mitogenic activity 
0 can be measured using known assays, including ^H-thymidine incorporation 
assays (as disclosed by. e.g., Raines and Ross, Methods EnzymoL 109:749- 
773, 1985), dye incorporation assays (as disclosed by, for example, Mosman. 
J. Immunol. Meth. 65:55-63, 1983 and Raz et al.. Acta Trop. 68:139-147, 
1997) or cell counts. A preferred mitogenesis assay measures the 
5 incorporation of the dye Alamar blue (Raz et al., ibid.) into pancreatic or 
hypothalamic cells. See also, Gospodarowicz et al., J. CelL Biol. 70:395-405, 
1976; Ewton and Florini, Endocrinol. 106:577-583. 1980; and Gospodarowicz 
et al., P/Dc. Natl. Acad. Set. USA 86:7311-7315. 1989. Differentiation can be 
assayed using suitable precursor cells that contain a differentiation-specific 
reporter element For example, pancreatic precursor cells can be used to 
measure the ability of a zsig51 protein to stimulate differentiation into a 
specific pancreatic cell type (e.g.. p-islet cells). An islet cell-specific 
promoter, such as an insulin gene promoter, can be linked to a reporter gene, 
such as a luciferase gene, whereby luciferase is expressed in the 
differentiated islet cells but not in the precursors. Precursor cells containing 
the reporter element can be obtained, for example, from mice made 
transgenic for the reporter element. A similar strategy can be applied to 
assay the effect of zsig51 protein on hypothalamic precursor cells and other 
cell types. 

Zsig51 polypeptides and proteins can be assayed in vivo 
through the use of viral delivery systems. Exemplary vimses for this purpose 
include adenovirus, herpesvirus, vaccinia virus and adeno-associated virus 
(AAV). Adenovirus, a double-stranded DNA virus, is cun-ently the best 
studied gene transfer vector for delivery of heterologous nucleic acid 
(reviewed by Becker et al.. Meth. Cell BioL 43:161-89, 1994; and Douglas 
and Curiel, Science & Medicine 4:44-53, 1997). The adenovirus system 
offers several advantages: adenovirus can (i) accommodate relatively large 
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DNA inserts; (ii) be grown to high-titer; (iii) infect a broad range of mammalian 
cell types; and (iv) be used with a large number of available vectors 
containing different promoters. Also, because adenoviruses are stable in the 
bloodstream, they can be administered by intravenous injection. 
5 By deleting portions of the adenovirus genome, larger inserts 

(up to 7 kb) of heterologous DNA can be accommodated. These inserts can 
be incorporated into the viral DNA by direct ligation or by homologous 
recombination with a co-transfected plasmid. In an exemplary system, the 
essential E1 gene is deleted from the viral vector, and the virus does not 

10 replicate unless the E1 gene is provided by the host cell (e.g., the human 293 
cell line). When intravenously administered to intact animals, adenovirus 
primarily targets the liver. If the adenoviral delivery system has an E1 gene 
deletion, the virus cannot replicate in the host cells. However, the host's 
tissue (e.g., liver) will express and process (and, if a signal sequence is 

15 present, secrete) the heterologous protein. Secreted proteins will enter the 
circulation in the highly vascularized liver, and effects on the infected animal 
can be detemiined. 

Mice engineered to express the zsig51 gene, referred to as 
"transgenic mice", and mice that exhibit a complete absence of zsig51 gene 

20 function, referred to as "knockout mice", can also be generated (Snouwaert 
et al., Science 257:1083, 1992; Lowell et al., Nature 366:740-742, 1993; 
Capecchi, Science 244:1288-1292. 1989; Palmiter et al. Anna, Rev, Genet 
20:465-499. 1986). For example, transgenic mice that over-express zsig51, 
either ubiquitously or under a tissue-specific or tissue-restricted promoter. 

25 can be used to determine if over-expression causes a phenotype. For 
example, over-expression of a wild-type zsig51 polypeptide, polypeptide 
fragment, or a mutant thereof may alter normal cellular processes, resulting 
in a phenotype that identifies a tissue in which zsigSI expression is 
functionally relevant and may indicate a therapeutic target for the zsig51, its 

3 0 agonists or antagonists. Moreover, such over-expression may result in a 
phenotype that shows similarity with human diseases. Similarly, knockout 
zsigSI mice can be used to determine where zsig51 is absolutely required in 
vivo. The phenotype of knockout mice may be predictive of the in vivo effects 
of a zsig51 antagonist such as those described herein. The murine zsig51 

35 cDNA can be used to isolate murine genomic DNA. which is subsequently 
used to generate knockout mice. These mice can be employed to study the 
zsigSI gene and the protein encoded thereby in an in vivo system, and can 
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If a mammal has an insufficiency of zsig51 polyp ptide (due to, 
for example, a mutated or absent zsigSI gene)» the 2sig51 g ne can be 
introduced into the cells of the mammal. In one embodiment, a gene 
encoding a zsig51 polypeptide is introduced in vivo in a viral vector. Such 
5 vectors include an attenuated or defective DNA virus, such as, but not limited 
to. herpes simplex virus (HSV). papillomavims, Epstein Barr virus (EBV)» 
adenovims, adeno-associated virus (AAV), and the like. Defective viaises, 
which entirely or almost entirely lack viral genes, are prefen^ed. A defective 
virus is not infective after introduction into a cell. Use of defective viral 

10 vectors allows for administration to cells in a specific, localized area, without 
concern that the vector can infect other cells. Examples of particular vectors 
include, but are not limited to, a defective herpes simplex virus 1 (HSV1) 
vector (Kaplitt et al.. Molec. Cell. Neurosci. 2:320-30, 1991); an attenuated 
adenovirus vector, such as the vector described by Stratford-Pemcaudet et 

15 al. (J. Clin. Invest 90:626-30, 1992); and a defective adeno-associated virus 
vector (Samulski et aL, J. Virol, 61:3096-101, 1987; Samulski et al., J. ViroL 
63:3822-28.1989). 

Within another embodiment, the zsigSI gene can be introduced 
in a retroviral vector, e.g., as described in Anderson et al.. U.S. Patent No. 

20 5,399,346; Mann et al., Ce// 33:153. 1983; Temin et al., U.S. Patent No. 
4,650.764; Temin et al., U.S. Patent No. 4,980,289; Markowitz et al.. J. ViroL 
62:1120, 1988; Temin et al., U.S. Patent No. 5,124,263; International Patent 
Publication No. WO 95/07358, published March 16. 1995 by Dougherty et al.; 
and Kuo et al., B/ood 82:845-852. 1993. In the alternative, the vector can be 

25 introduced by lipofection in vivo using liposomes. Synthetic cationic lipids 
can be used to prepare liposomes for in vivo transfection of a gene encoding 
a marker (Feigner et al., Proc. NatL Acad, Sci USA 84:7413-17, 1987; 
Mackey et al.. Proa, NatL Acad ScL USA 85:8027-31, 1988). The use of 
lipofection to introduce exogenous genes into specific organs in vivo has 

3 0 certain practical advantages, including the ability to direct transfection to 
particular cells. Directing transfection to particular cell types is particularly 
advantageous in a tissue with cellular heterogeneity, such as the pancreas, 
liver, kidney, or brain. Lipids can be chemically coupled to other molecules 
for the purpose of targeting. Targeted peptides (e.g.. hormones or 

3 5 neurotransmitters), proteins such as antibodies, or non-peptide molecules 
can be coupled to liposomes chemically. 
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Within another embodiment, target cells are removed from the 
body, heterologous DNA is introduced as a naked DNA plasmid, and the cells 
are re-implanted in the body. Naked DNA vectors for gene therapy can be 
introduced into the desired host cells by methods known in the art, e.g., 
5 transfection. electroporation, microinjection, transduction, cell fusion. DEAE 
dextran. calcium phosphate precipitation, use of a gene gun or use of a DNA 
vector transporter. See. e.g.. Wu et al.. J. BioL Chem. 267:963-67. 1992; Wu 
et al.. J. BioL Chem, 263:14621-24. 1988. 

The present invention further provides antisense 

10 polynucleotides that are complementary to a segment of the polynucleotides 
set forth in SEQ ID NO: 1 Such synthetic antisense oligonucleotides are 
designed to bind to mRNA encoding zsigSI polypeptides and to inhibit 
translation of such mRNA. Such antisense oligonucleotides are used to 
inhibit expression of zsigSI polypeptide-encoding genes in cell culture or in a 

15 subject. Antisense approaches may be applied in the treatment of conditions 
such as cancers or hyperplasias of the pancreas or pituitary, pituitary 
homnone hypersecretion, prolactin hypersecretion, . growth hormone 
hypersecretion (acromegaly), and ACTH hypersecretion (Cushing's disease). 
The present invention also provides reagents for use in 

20 diagnostic applications. The human zsigSI gene has been localized to 
chromosome band 11q13. Several other genes of interest have been 
localized to this region of chromosome 11, including tumor suppressor gene 
MEN1 (multiple endocrine neoplasia type 1 , an autosomal dominant familial 
cancer syndrome characterized by tumors in enteropancreatic endocrine 

25 tissues, anterior pituitary, and parathyroid, and by peptic ulcers), which has 
been positionally cloned (Chandrasekharappa et al.. Science 276:404-407. 

1997) . and a second, as yet uncloned, gene also associated with MEA/Mike 
symptoms (Chakrabarti et al., Genes Chromosomes Cancer 22:130-137. 

1998) . ZsigSI is a candidate for this as yet unidentified disease gene, as 
30 well as for IDDM4, an insulin-dependent diabetes mellitus susceptibility locus 

on 11q13, and for Bardet-Biedl syndrome type 1. Thus, the zsigSI gene, a 
probe comprising zsigSI DNA or RNA, or a subsequence thereof can be 
used to determine if the zsigSI gene is present on chromosome 11 or if a 
mutation has occurred. Detectable chromosomal aberations at the zsigSI 
3 5 gene locus include, but are not limited to, aneuploidy, gene copy number 
changes, insertions, deletions, restriction site changes and rearrangements. 
These aberrations can occur within the coding sequence, within introns. or 
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within flanking sequences, including upstream promoter and regulatory 
regions, and may be manif sted as physical alterations within a coding 
sequence or changes in gene expression level. Analytical probes will 
generally be at least 20 nucleotides in length, although somewhat shorter 
5 probes (14-17 nucleotides) can be used. PGR primers are at least 5 
nucleotides in length, preferably 15 or more nt, more preferably 20-30 nt 
Short polynucleotides can be used when a small region of the gene is 
targetted for analysis. For gross analysis of genes, a polynucleotide probe 
may comprise an entire exon or more. Probes will generally comprise a 

10 polynucleotide linked to a signal-generating moiety such as a radionudeotide. 
In general, these diagnostic methods comprise the steps of (a) obtaining a 
genetic sample from a patient; (b) incubating the genetic sample with a 
polynucleotide probe or primer as disclosed above, under conditions wherein 
the polynucleotide will hybridize to complementary polynucleotide sequence, 

15 to produce a first reaction product; and (c) comparing the first reaction 
product to a control reaction product. A difference between the first reaction 
product and the control reaction product is indicative of a^genetic abnormality 
in the patient. Genetic samples for use within the present invention include 
genomic DNA, cDNA, and RNA, The polynucleotide probe or primer can be 

20 RNA or DNA, and will comprise a portion of SEQ ID N0:1, the complement of 
SEQ ID N0:1, or an RNA equivalent thereof. Suitable assay methods in this 
regard include molecular genetic techniques known to those in the art. such 
as restriction fragment length polymorphism (RFLP) analysis, short tandem 
repeat (STR) analysis employing PGR techniques, ligation chain reaction 

25 (Barany, PCR Methods and Applications 1:5-16, 1991), ribonuclease 
protection assays, and other genetic linkage analysis techniques known in 
the art (Sambrook et al., ibid.] Ausubel et. al., ibid,] A,J. Marian. Cliest 
108:255-65, 1995). Ribonuclease protection assays (see, e.g., Ausubel et 
al., ibid., ch. 4) comprise the hybridization of an RNA probe to a patient RNA 

30 sample, after which the reaction product (RNA-RNA hybrid) is exposed to 
RNase. Hybridized regions of the RNA are protected from digestion. Within 
PCR assays, a patient genetic sample is incubated with a pair of 
oligonucleotide primers, and the region between the primers is amplified and 
recovered. Changes in size, amount, or sequence of recovered product are 

35 indicative of mutations in th patient. Another PCR-based technique that can 
be mployed is single strand conformational polymorphism (SSCP) analysis 
(Hayashi, PCR Methods and Applications 1991). 
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antibody can bind. See, for example, Geysen et aL, Proc, Natl. Acad ScL 
USA 81:3998-4002, 1984. Epitopes can be linear or confonmational, the 
latter being composed of discontinuous regions of the protein that fonm an 
epitope upon folding of the protein. Linear epitopes are generally at least 6 
5 amino acid residues in length. Relatively short synthetic peptides that mimic 
part of a protein sequence are routinely capable of eliciting an antiserum that 
reacts with the partially mimicked protein. See, Sutclrffe et a!., Science 
219:660-666, 1983. Antibodies that recognize short, linear epitopes are 
particularly useful in analytic and diagnostic applications that employ 

10 denatured protein, such as Western blotting (Tobin, Proc. Natl. Acad Sc/. 
USA 76:4350-4356, 1979). Antibodies to short peptides may also recognize 
proteins in native conformation and will thus be useful for monitoring protein 
expression and protein isolation, and in detecting zsigSI proteins in solution, 
such as by ELISA or in immunoprecipitation studies. 

15 Antigenic, epitope-bearing polypeptides of the present invention 

are useful for raising antibodies, including monoclonal antibodies, that 
specifically bind to a zsigSI protein. Antigenic, epitope-bearing polypeptides 
contain a sequence of at least six, preferably at least nine, more preferably 
from 15 to about 30 contiguous amino acid residues of a zsig51 protein (e.g., 

20 SEQ ID N0:2). Polypeptides comprising a larger portion of a zsigSI protein, 
i.e. from 30 to 50 residues up to the entire sequence are included. It is 
preferred that the amino acid sequence of the epitope-bearing polypeptide is 
selected to provide substantial solubility in aqueous solvents, that is the 
sequence includes relatively hydrophilic residues, and hydrophobic residues 

25 are substantially avoided. 

As used herein, the term "antibodies" includes polyclonal 
antibodies, monoclonal antibodies, antigen-binding fragments thereof such 
as F(ab')2 and Fab fragments, single chain antibodies, and the like, including 
genetically engineered antibodies. Non-human antibodies can be humanized 

30 by grafting only non-human CDRs onto human framework and constant 
regions, or by incorporating the entire non-human variable domains 
(optionally "cloaking" them with a human-like surface by replacement of 
exposed residues, wherein the result is a "veneered" antibody). In some 
instances, humanized antibodies may retain non-human residues within the 

35 human variable region framework domains to enhance proper binding 
characteristics. Through humanizing antibodies, biological half-life may be 
increased, and the potential for adverse immune reactions upon 
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administration to humans is reduced. One skilled in the art can generate 
humanized antibodies with specific and different constant domains (i.e., 
different Ig subclasses) to facilitate or inhibit various immune functions 
associated with particular antibody constant domains. 
5 Alternative techniques for generating or selecting antibodies 

useful herein include in vitro exposure of lymphocytes to zsig51 polypeptide, 
and selection of antibody display libraries in phage or similar vectors (for 
instance, through use of immobilized or labeled zsig51 polypeptide). Human 
antibodies can be produced in transgenic, non-human animals that have 
10 been engineered to contain human immunoglobulin genes as disclosed in 
WlPO Publication WO 98/24893. It is preferred that the endogenous 
immunoglobulin genes in these animals be inactivated or eliminated, such as 
by homologous recombination. 

Antibodies are defined to be specifically binding if they bind to a 
zsig51 polypeptide with an affinity at least 10-fold greater than the binding 
affinity to control (non-zsigSI) polypeptide. It is prefen-ed that the antibodies 
exhibit a binding affinity (Kg) of 10^ M'^ or greater, preferably 10^ M'^ or 
greater, more preferably 10® M"^ or greater, and most preferably 10^ M"^ or 
greater. The affinity of a monoclonal antibody can be readily determined by 
one of ordinary skill in the art (see. for example, Scatchard, Ann. NY Acad, 
Sc/. 51:660-672. 1949). 

Methods for preparing polyclonal and monoclonal antibodies 
are well known in the art (see for example, Hurrell, J. G. R., Ed., Monoclonal 
Hybridoma Antibodies: Techniques and Applications, CRC Press. Inc., Boca 
Raton, FL, 1982). As would be evident to one of ordinary skill in the art. 
polyclonal antibodies can be generated from a variety of warm-blooded 
animals such as horses, cows, goats, sheep, dogs, chickens, rabbits, mice, 
and rats. The immunogenicity of a zsigSI polypeptide may be increased 
through the use of an adjuvant such as alum (aluminum hydroxide) or 
Freund's complete or incomplete adjuvant. Polypeptides useful for 
immunization also include fusion polypeptides, such as fusions of a zsig51 
polypeptide or a portion thereof with an immunoglobulin polypeptide or with 
maltose binding protein. The polypeptide immunogen may be a full-length 
molecule or a portion thereof. If the polypeptide portion is "hapten-like", such 
portion may be advantageously joined or linked to a macromolecular carrier 
(such as keyhole limpet hemocyanin (KLH), bovine serum albumin (BSA) or 
tetanus toxoid) for immunization. 
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A variety of assays known to those skilled in the art can be 
utilized to detect antibodies that specifically bind to zsigSI polypeptides. 
Exemplary assays are described in detail in Antibodies: A Laboratory Manual, 
Harlow and Lane (Eds.). Cold Spring Harbor Laboratory Press. 1988. 
5 Representative examples of such assays include concurrent 
immunoelectrophoresis, radio-immunoassays, radio-immunoprecipitations. 
enzyme-linked immunosorbent assays (ELISA). dot blot assays, Western blot 
assays, inhibition or competition assays, and sandwich assays. 

Antibodies to zsigSI can be used, for example, to isolate zsig51 

10 polypeptides by affinity purification; for diagnostic assays for detemiining 
circulating or localized levels of zsig51 polypeptides; for screening expression 
libraries; for generating anti-idiotypic antibodies; and as neutralizing 
antibodies or as antagonists to block zsig51 activity in vitro and in vivo. 

Antibodies and polypeptides disclosed herein can also be 

15 directly or indirectly conjugated to drugs, toxins, radionuclides, and the like, 
and these conjugates used for in vivo diagnostic or therapeutic applications. 
For instance, polypeptides or antibodies of the present invention may used to 
identify or treat tissues or organs that express a corresponding anti- 
complementary molecule (receptor or antigen, respectively, for instance). 

20 More specifically, zsig51 polypeptides or anti-zsig51 antibodies, or bioactive 
fragments or portions thereof, can be coupled to detectable or cytotoxic 
molecules and delivered to a mammal having cells, tissues or organs that 
express the anti-complementary molecule. Suitable detectable molecules 
may be directly or indirectly attached to the polypeptide or antibody, and 

25 include radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent 
markers, chemiluminescent markers, magnetic particles, and the like. 
Cytotoxic molecules can be directly or indirectly attached to the polypeptide 
or antibody, and include bacterial or plant toxins (for instance, diphtheria 
toxin. Pseudomonas exotoxin, ricin, saporin, abrin, and the like), as well as 

30 therapeutic radionuclides, such as iodine-131, rhenium-188 or yttrium-90 
(either directly attached to the polypeptide or antibody, or indirectly attached 
through means of a chelating moiety, for instance). Polypeptides or 
antibodies can also be conjugated to cytotoxic drugs, such as adriamycin. 
For indirect attachment of a detectable or cytotoxic molecule, the detectable 

35 or cytotoxic molecule may be conjugated with a member of a 
complementary/anticomplementary pair, where the other member is bound to 
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the polypeptide or antibody portion. For these purposes, biotin/streptavidin is 
an exemplary complem ntary/anticomplementary pair. 

In another embodiment, polypeptide-toxin fusion proteins or 
antibody-toxin or fragment-toxin fusion proteins may be used for targeted cell 
5 or tissue inhibition or ablation (for instance, to treat cancer cells or tissues). 
Target cells (i.e., those displaying the 2sig51 receptor) bind the zsig51-toxin 
conjugate, which is then internalized, killing the cell. The effects of receptor- 
specific cell killing (target ablation) are revealed by changes in whole animal 
physiology or through histological examination. Thus, ligand-dependent, 
0 receptor-directed cyotoxicity can be used to enhance understanding of the 
physiological significance of a protein ligand. A prefenred such toxin is 
saporin. Mammalian cells have no receptor for saporin, which is non-toxic 
when it remains extracellular Alternatively, if the polypeptide has multiple 
functional domains (i.e., an activation domain or a ligand binding domain, 
plus a targeting domain), a fusion protein including only the targeting domain 
may be suitable for directing a detectable molecule, a cytotoxic molecule or a 
complementary molecule to a cell or tissue type of interest. In instances 
where the domain-only fusion protein includes a complementary molecule, 
the anti-complementary molecule can be conjugated to a detectable or 
0 cytotoxic molecule. Such domain-complementary molecule fusion proteins 
thus represent a generic targeting vehicle for cell- or tissue-specific delivery 
of generic anti-complementary-detectable/ cytotoxic molecule conjugates. 

In another embodiment, polypeptide-cytokine fusion proteins or 
antibody-cytokine fusion proteins may be used for enhancing in vitro 
cytotoxicity (for instance, that mediated by monoclonal antibodies against 
tumor targets) and for enhancing in vivo killing of target tissues (for example, 
blood and bone marrow cancers). See, generally, Hornick et al., Blood 
89:4437-4447, 1997). In general, cytokines are toxic if administered 
systemically. The described fusion proteins enable targeting of a cytokine to 
a desired site of action, thereby providing an elevated local concentration of 
cytokine. Suitable zsigSI polypeptides or anti-zsig51 antibodies target an 
undesirable cell or tissue (e.g., a tumor), and the fused cytokine mediates 
improved target cell lysis by effector cells. Suitable cytokines for this purpose 
include, for example, interleukin-2 and granulocyte-macrophage colony- 
stimulating factor (GM-CSF). 
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The bioactive polypeptide and antibody conjugates described 
herein can be delivered intravenously, intraarterially or intraductally. or nnay 
be introduced locally at the intended site of action. 

Inhibitors of zsig51 activity (2sig51 antagonists) include anti- 
5 2sig51 antibodies and soluble 2sig51 receptors, as well as other peptidic and 
non-peptidic agents (including ribozymes). Such antagonists can be used to 
block the effects of zsig51 in vitro and in vivo. Of particular interest is the use 
of antagonists of zsig51 activity in cancer therapy. As early detection 
methods improve it becomes possible to intervene at earlier times in tumor 

10 development making it feasible to use inhibitors of angiogenesis to block the 
angiogenic switch that precedes the progression to invasive cancer. 
Inhibitors of zsigSI activity can be used in combination with other cancer 
therapeutic agents. 

For pharmaceutical use, the proteins of the present invention 

15 are formulated for parenteral, particularly intravenous or subcutaneous, 
delivery according to conventional methods. Intravenous administration will 
be by bolus injection or infusion over a typical period of one to several hours. 
In general, phamiaceutical fomiulations will include a zsig51 protein in 
combination with a phamnaceutically acceptable vehicle, such as saline. 

20 buffered saline, 5% dextrose in water or the like. Formulations may further 
include one or more excipients, presen/atives, solubilizers. buffering agents, 
albumin to prevent protein loss on vial surfaces, etc. Methods of formulation 
are well known in the art and are disclosed, for example, in Remington: The 
Science and Practice of Pharmacy, Gennaro, ed., Mack Publishing Co., 

25 Easton, PA, 19th ed.. 1995. Therapeutic doses will generally be in the range 
of 0.1 to 100 jig/kg of patient weight per day, preferably 0.5-20 |ig/kg per day, 
with the exact dose detemnined by the clinician according to accepted 
standards, taking into account the nature and severity of the condition to be 
treated, patient traits, etc. Doses of zsigSI protein will generally be 

30 administered on a daily to weekly schedule, with individual doses typically 
within the range of 0.1-10 mg/patient. Determination of dose is within the 
level of ordinary skill in the art. The proteins may be administered for acute 
treatment, over one week or less, often over a period of one to three days or 
may be used in chronic treatment, over several months or years. In general, 

3 5 a therapeutically effective amount of zsigSI is an amount sufficient to 
produce a clinically significant change in the targetted condition. 
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The invention is further illustrated by the following non-limfting 

examples. 

Example 1: Production a Pancreatic Islet Cell cDNA Library 
5 RNA extracted from pancreatic islet cells was reverse 

transcribed in the following manner. The first strand cDNA reaction contained 
10 jil of human pancreatic islet cell poly d(T)-selected poly (A)"*" mRNA 
(Clontech Laboratories. Inc., Palo Alto, CA) at a concentration of 1.0 mg/ml, 
and 2 ^l of 20 pmole/|il first strand primer ZC6171 (SEQ ID NO:23) 

10 containing an Xho I restriction site. The mixture was heated at 70°C for 2.5 
minutes and cooled by chilling on ice. First strand cDNA synthesis was 
initiated by the addition of 8 ^1 of first strand buffer (5x SUPERSCRIPT^" 
buffer; Life Technologies, Gaithersburg. MD), 4 ^1 of 100 mM dithiothreitol, 
and 3 [xl of a deoxynucleotide triphosphate (dNTP) solution containing 10 mM 

15 each of dTTP, dATP, dGTP and 5-methyl-dCTP (Pharmacia LKB 
Biotechnology. Piscataway, NJ) to the RNA-primer mixture. The reaction 
mixture was incubated at 40** C for 2 minutes, followed. by. the addition of 10 
^il of 200 Will RNase reverse transcriptase (SUPERSCRIPT II®; Life 
Technologies). The efficiency of the first strand synthesis was analyzed in a 

20 parallel reaction by the addition of 10 ^Ci of 32p-(xdCTP to a 5 jxl aliquot from 
one of the reaction mixtures to label the reaction for analysis. The reactions 
were incubated at 40*'C for 5 minutes, 45'*C for 50 minutes, then incubated at 
50°C for 10 minutes. Unincorporated 32p_ctdCTP in the labeled reaction was 
removed by chromatography on a 400 pore size gel filtration column 

25 (Clontech Laboratories, Inc.). The unincorporated nucleotides and primers in 
the unlabeled first strand reactions were removed by chromatography on 400 
pore size gel filtration column. The length of labeled first strand cDNA was 
detemiined by agarose gel electrophoresis. 

The second strand reaction contained 102 |il of the unlabeled 

30 first strand cDNA. 30 ^il of 5x polymerase I buffer {125 mM Tris-HCI, pH 7.5, 
500 mM KCI, 25 mM MgCl2, 50mM (NH4)2S04)), 2.0 ^il of 100 mM 
dithiothreitol, 3.0 jil of a solution containing 10 mM of each deoxynucleotide 
triphosphate, 7 |al of 5 mM p-NAD, 2.0 pi of 10 U/^il £. coli DNA ligase (New 
England Biolabs), 5 ^1 of 10 U/^il E. coli DNA polymerase I (New England 

35 Biolabs, Beverly, MA), and 1.5 iil of 2 U/^l RNase H (Life Technologies), A 
10 ^l aliquot from one of the second strand synthesis reactions was labeled 
by the addition of 10 jiCi 32p^dCTP to monitor the efficiency of second 
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strand synthesis. The reactions wer incubated at 16** C for two hours, 
followed by the addition of 1 jil of a 10 mM dNTP solution and 6.0 ^1 T4 DNA 
polymerase (10 U/^l, Boehringer Mannheim. Indianapolis. IN) and incubated 
for an additional 10 minutes at 16**C. Unincorporated 32p^dCTP in the 
5 reaction mixture was removed by chromatography through a 400 pore size 
gel filtration column before analysis by agarose gel electrophoresis. The 
reaction was temiinated by the addition of 10.0 ^1 0.5 M EDTA and extraction 
with phenol/chloroform and chlorofonm followed by ethanol precipitation in the 
presence of 3.0 M Na acetate and 2 ^il of Pellet Paint carrier (Novagen, 
10 Madison, Wl). The yield of cDNA was estimated to be approximately 2 ^ig 
from starting mRNA template of 10 ^g. 

Eco Rl adapters were ligated onto the 5' ends of the cDNA 
described above to enable cloning into an expression vector. A 12.5 |il 
aliquot of cDNA (-2.0 ^ig) and 3 ^l of 69 pmole/ul of Eco Rl adapter 
15 (Pharmacia LKB Biotechnology Inc.. Piscataway, NJ) were mixed with 2.5 ^il 
10x ligase buffer (660 mM Tris-HCI pH 7.5. 100 mM MgCl2), 2.5 ^il of 10 mM 
ATP. 3.5 ^il 0.1 M DTT and 1 ^tl of 15 U/jil T4 DNA ligase (Promega Corp., 
Madison. Wl). The reaction was incubated 1 hour at 5*'C, 2 hours at 7.5X; 2 
hours at 10X. 2 hours at 12,5'C and 16 hours at 10** C. The reaction was 
20 terminated by the addition of 65 ^1 HjO and 10 ^l 10X H buffer (Boehringer 
Mannheim, Indianapolis, IN) and incubation at 70°C for 20 minutes. 

To facilitate the directional cloning of the cDNA into an 
expression vector, the cDNA was digested with Xho I, resulting in a cDNA 
having a 5' Eco Rl cohesive end and a 3' Xho I cohesive end. The Xho I 
25 restriction site at the 3' end of the cDNA had been previously introduced. 
Restriction enzyme digestion was carried out in a reaction mixture by the 
addition of 1.0 ^1 of 40 U/|il Xho I (Boehringer Mannheim). Digestion was 
carried out at 37''C for 45 minutes. The reaction was terminated by 
incubation at 70*^0 for 20 minutes and chromatography through a 400 pore 
3 0 size gel filtration column. 

The cDNA was ethanol precipitated, washed with 70% ethanol. 
air dried and resuspended in 10.0 ^il water. 2 of 10X kinase buffer (660 mM 
Tris-HCI. pH 7.5. 100 mM MgCy, 0.5 \xl 0.1 M DTT, 2 ^1 10 mM ATP. 2 ^1 T4 
polynucleotide kinase (10 U/|il. Life Technologies). Following incubation at 
3 5 37^0 for 30 minutes, the cDNA was ethanol precipitated in the presence of 
2.5 M ammonium acetate, and electrophoresed on a 0.8% low melt agarose 
gel. The contaminating adapters and cDNA below 0.6 kb in length were 
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excised from the gel. The electrodes were reversed, and the cDNA was 
electrophoresed until concentrated near the Ian origin. The area of the gel 
containing the concentrated cDNA was excised and placed in a microfuge 
tube, and the approximate volume of the gel slice was detemiined. An 
5 aliquot of water approximately three times the volume of the gel slice (300 jil) 
and 35 ^il 10x p-agarose I buffer (New England Biolabs) were added to the 
tube, and the agarose was melted by heating to 65''C for 15 minutes. 
Following equilibration of the sample to 45°C. 3 >il of 1 U/^l p-agarase I (New 
England Biolabs, Beverly, MA) was added, and the mixture was incubated for 

id 60 minutes at 45»C to digest the agarose. After incubation, 40 nl of 3 M Na 
acetate was added to the sample, and the mixture was incubated on ice for 
15 minutes. The sample was centrifuged at 14,000 x g for 15 minutes at 
room temperature to remove undigested agarose. The cDNA was ethanol 
precipitated, washed in 70% ethanol, air-dried and resuspended in 40 ^1 

15 water. 

Following recovery from low-melt agarose gel, the cDNA was 
cloned into the Eco Rl and Xho I sites of a phagemid. vector (pBluescript II 
SK*; Sfratagene, La Jolla, CA) and electroporated into DH10B cells. Bacterial 
colonies containing ESTs of known genes were identified and eliminated from 

20 sequence analysis by reiterative cycles of probe hybridization to hi-density 
colony filter an-ays (Genome Systems, St. Louis, Ml). cDNAs of known 
genes were pooled in groups of 50 - 100 inserts and were labeled with "P- 
adCTP using a MEGAPRIME labeling kit (Amersham, Arlington Heights. IL). 
Colonies which did not hybridize to the probe mixture were selected for 

25 sequencing. Sequencing was done using automated equipment. The 
resulting data were analyzed, resulting in the identification of a novel EST 
designated SISF1 000391. The sequence of this EST is shown in SEQ ID 
NO:24. The plasmid containing this insert was designated pSLSIG51-1. The 
insert was sequenced and found to comprise the sequence shown in SEQ ID 

3 0 N0:1. 

Subsequent to the cloning of pSLSIG51-1, a zsigSI clone was 
identified in a pituitary cDNA library. 

Example 2: Tissue Distribution of Zsia51 
35 Blots of human RNA (Human Multiple Tissue Northem Blots I, 

II, and III; and Human RNA Master Blot; Clontech Laboratories, Inc., Palo 
Alto, CA) were probed to determine the tissue distribution of zsig51. A cDNA 
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probe was generated using 20 pmoles each of oligonucleotide primers 
ZC16.013 (SEQ ID NO:25) and ZC16.014 (SEQ ID NO:26) and 5 jil of a 
pancreas cDNA library (prepared from pancreas RNA using a commercially 
available kit (Marathon™ cDNA Amplification Kit from Clontech Laboratories, 
5 Inc.) and diluted 1:100). The probe was generated by PGR. incubating the 
reaction mixture at 94X for 1 minute followed by 30 cycles of 94X, 20 
seconds; 30 seconds; 72'C, 30 seconds; then a final extension for 7 
minutes at 72**C. Reaction products were electrophoresed on a 1.25% Tris- 
borate/EDTA gel, and the 180 bp product was excised from the gel. DMA 

10 was extracted from the gel slab using a commercially available kit 
(QIAquick™ Gel Extraction Kit; Qiagen, Inc., Santa Clarita. CA). This PGR 
product was sequenced and found to be a portion of the zsigSI sequence. 
98.7 ng of the extracted fragment was labeled with ^^P (Multiprime™ DNA 
Labeling System; Amersham Gorporation. Arlington Heights, IL). 

15 Unincorporated radioactivity was removed by column chromatography using 
a commercially available push column (NucTrap® column; Stratagene 
Gloning Systems. La Jolla, GA; see U.S. Patent No. 5,3.36,412). The blots 
were prehybridized for 3 hours at 65X in a hybridization solution 
(ExpressHyb™ Hybridization Solution; Glontech Laboratories. Inc.) containing 

20 1 mg of boiled salmon spenn DNA. 10.2 x 10® cpm of probe and 1 mg of 
salmon sperm DNA were boiled for 5 minutes, iced, mixed with 10 ml of 
ExpressHyb*^ solution, and added to the blots. The blots were incubated in 
the solution overnight at 65X. Initial washes were done for 40 minutes in 2 x 
SSG. 0.1% SDS at room temperature, followed by a 40-minute wash at 50**G 

25 in 0.1 X SSG. 0.1% SDS with one change of wash solution. The washed 
blots were exposed to film at -80**G overnight, then washed again with 0.1 x 
SSG. 0.1% SDS at 65X for 40 minutes to remove background and exposed 
ovemight at -80°G. High expression was seen in pancreas, with lower 
expression in pituitary gland. The transcript size was approximately 1 kb. 

30 Additional Northern blotting experiments were canied . out 

essentially as disclosed above using a longer human zsigSI probe. Similar 
expression in pancreas and pituitary was seen, and a low-abundance 
transcript of approximately 4 kb was seen in human testis. 

35 Example 3: Ghromosomal MaopinQ 

Human zsigSI was mapped to chromosome 11 using the 
commercially available GeneBridge 4 Radiation Hybrid Panel (Research 
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transformant The transformant was streaked on an LB plate containing 175 
^ig/ml annpicillin and 25 ^g/mi methiciilin and incubated overnight at 37**C. 
The cDNA insert was sequenced and found to contain an open reading frame 
encoding 128 amino acids, including a putative 20 amino acid secretory 
5 peptide. The DNA and amino acid sequences are shown in SEQ ID N0:31 
and NO:32, respectively. 

Example 5: Chromosomal Mapping of Mouse Zsiq51 Gene 

The zsig51 gene was mapped in mouse using the 

10 commercially available mouse T31 whole genome radiation hybrid (WGRH) 
panel (Research Genetics, Inc., Huntsville, AL) and Map Manager QT linkage 
analysis program. The T31 WGRH panel contains PCRable DNAs from each 
of 100 radiation hybrid clones, plus two control DNAs (the 129aa donor and 
the A23 recipient). For mapping, 20-pl reactions were run essentially as 

15 disclosed in Example 3 using sense primer ZC18,588 (SEQ ID NO:40). 5* 
CCG TTT CTC CCG CTA CTA 3'. and antisense primer ZC18.589 (SEQ ID 
N0:41), 5' GGG CCA ACC TCA TCT TCA 3\ The PCR cycler conditions 
were as follows: an initial 5-minute denaturation at 94*'C; 35 cycles of 1 
minute denaturation at 94**C, 1 minute annealing at 66X, and 90 seconds 

20 extension at 72^; followed by a final extension of 7 minutes at 72X. The 
reaction products were separated by electrophoresis on a 2% agarose gel 
(Life Technologies, Gaithersburg. MD). 

At P = 0.0001, mouse zsigSI linked to the maricer D19Mit68 
with a LCD score of 8.2. D19Mit68 has been mapped at 6.0 cM on mouse 

25 chromosome 19. This is a known region of synteny or linkage conservation 
with the region of human chromosome 11 where the human fomi of zsig51 
has been mapped. 

Examples: RatZsia51 cDNA 

30 RNA extracted from rat pancreatic islet cells was reverse 

transcribed. The first strand cDNA reaction contained 16 nl of rat pancreatic 
islet cell poly d(D-selected poly (A)**" mRNA at a concentration of 0.4 |ig/ml. 
and 2 ^1 of 20 pmole/^il first strand primer (ZC6172; SEQ ID NO:33) 
containing an Xho I restriction site. The mixture was heated at 70*'C for 3 

35 minutes and cooled by chilling on ice. First strand cDNA synthesis was 
initiated by the addition of 8 jil of first strand buffer (5x SUPERSCRIPF" 
buffer; Life Technologies, Gaithersburg, MD), 4 iil of 100 mM dithiothreitol. 
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and 2 [il of a deoxynucleotide triphosphate (dNTP) solution containing 10 mM 
each of dTTP. dATP. dGTP and 5-methyklCTP (Pharmacia LKB 
Biotechnology, Piscataway. NJ) to the RNA-primer mixture. The reaction 
mixture was incubated at 45''C for 2 minutes, followed by the addition of 10 ^l 
5 of 200 Will RNase H" reverse transcriptase (SUPERSCRIPT II®; Life 
Technologies). The efficiency of the first strand synthesis was analyzed in a 
parallel reaction by the addition of 10 ^Ci of 32p^xdCTP to a 5 ^il aliquot from 
one of Vne reaction mixtures to label the reaction for analysis. The reaction 
mixtures were incubated at 45»C for 50 minutes, then at SO'C for 10 minutes, 
id Unincorporated 32p^clCTP in the labeled reaction was removed by 
chromatography on a 400 pore size gel filti-ation column (Clontech 
Laboratories, Inc.). The unincorporated nucleotides and primers in the 
unlabeled first stirand reactions were removed by chromatography on a 400 
pore size gel filti-ation column. The length of labeled first sti^nd cDNA was 
15 determined by agarose gel electrophoresis. 

The second sb^and reaction contained 100 ^1 of tiie unlabeled 
first strand cDNA, 30 ^1 of 5x polymerase I buffer (125. mM Tris-HCI. pH 7.5. 
500 mM KCI, 25 mM MgCl2. 50mM (NH4)2S04)), 2.0 jil of 100 mM 
dithiothreitol, 3.0 jil of a solution containing 10 mM of each deoxynucleotide 
20 triphosphate, 7 nl of 5 mM p-NAD, 2.0 ^1 of 10 U/^l £ coli DNA ligase (New 
England Biolabs), 6 ^il of 10 U/^l E. coli DNA polymerase I (New England 
Biolabs, Beveriy, MA), and 1.5 jil of 2 U/^il RNase H (Life Technologies). A 
1 0-jil aliquot from one of the second strand synthesis reactions was labeled 
by the addition of 10 ^Ci 32p.aciCTP to monitor tine efficiency of second 
25 strand synthesis. The reactions were incubated at 16**C for two hours, 
followed by the addition of 1 nl of a 10 mM dNTP solution and 6.0 ^i T4 DNA 
polymerase (10 U/jal, Boehringer Mannheim, Indianapolis. IN) and incubated 
for an additional 10 minutes at IS^C. Unincorporated 32p^(jcTP in the 
reaction mixture was removed by chromatography through a 400 pore size 
30 gel filtration column before analysis by agarose gel electrophoresis. The 
reaction was tenninated by tiie addition of 5 ^il 0.5 M EDTA and extraction 
with phenol/chloroform and chloroform followed by ethanol precipitation in the 
presence of 3.0 M Na acetate and 2 nl of a dye-labeled carrier (Pellet Paint™ 
Co-Precipitant; Novagen, Madison, Wl). The yield of cDNA was estimated to 
35 be approximately 2 ^g from starting mRNA template of 1 0 jig. 

Eco Rl adapters were ligated onto the 5' ends of the cDNA 
described above to enable cloning into an expression vector. A 10 jal aliquot 
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precipitated, washed in 70% ethanol. air-dried and resuspended in 40 (il 
water 

Following recovery from the low-melt agarose gel, the cDNA 
was cloned into the Eco Rl and Xho I sites of a phagemid vector 
5 (pBluescript® II SK(+); Stratagene. La Jolla, CA) and eiectroporated into 
DH10B cells, 

1.5 million pfu from the rat pancreas cDNA library were plated 
onto 150 mm NZY plates at a density of 40,000 pfu/plate on electroporation- 
competent £ coli cells (XLI-Blue MRF strain; Stratagene). Following 

10 incubation at Zl^'C overnight, filter lifts were made using nylon membranes 
(Hybond-N™; Amersham Corporation, Arlington Heights, IL). according to the 
procedures provided by the manufacturer. The filters were processed by 
denaturation in a solution containing 1.5 M NaCI and 0.5 M NaOH for 7 
minutes at room temperature. The filters were neutralized in 0.5 M TrisiHCI. 

15 pH 7.2 for 7 minutes. Phage DNA was fixed onto the filters with 1,200 
fxJoules of UV energy in a UV cross-linker (Stratagene). The filters were then 
washed with 0.25X SSC at 70^'C to remove excess cellular debris. 

A probe was generated by PGR using oligonucleotide primers 
ZC16763 (SEQ ID NO:34) and ZC16753 (SEQ ID NO:35) and plasmid 

20 pSLSIG51-1 as a template. One \i\ of a 1:100 dilution of the plasmid prep 
used to sequence pSLSIG61-1 was combined with 1 ^1 of 20 pmole/}il 
ZC16763, 1 ^l of 20 pmole/^l ZC16753, and 45 ^l of a mixture of Taq DNA 
polymerase, salts, magnesium, and deoxynucleotide triphosphates (PGR 
Supermix; Life Technologies, Gaithersburg, MD). The amplification was 

25 candied out at 94''C for 30 seconds followed by 35 cycles of 20 seconds at 
94**G, 20 seconds at 55°G, and 1 minute at 68°C; followed by a final 
incubation at 68°G for 5 minutes. The probe was purified by gel 
electrophoresis. 

The filters were pre-washed six times for 30 minutes in hot 
3 0 0.25X SSG, 0.25% SDS, prehybridized overnight at 60"G in the same 
solution containing a 1/200 dilution of denatured hening sperm DNA, then 
hybridized to the probe over the weekend at 60°C in hybridization solution 
(containing, per liter: 250 ml 20X SSC (0.45^ filtered). 50 ml 100X 
Denhardfs (5 Prime-3 Prime. Boulder, GO), 2 ml 0.5 M EDTA, 20 ml 10% 
35 SDS (Research Genetics. Huntsville, AL), and water (Baxter. McGaw Pari<, 
IL) containing 20 y^/vn\ sheared DNA (Research Genetics) denatured at 98°G 
for 10 minute and -7.5 ^1 probe denatured at 98°G for 4 minutes. Positives 
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Example 7: ZsioSI Adenovirus V ctor Construction 

The protein coding regions of human and mouse zsigSI were 
amplified by PGR using primers that added Fsel and AscI restriction sties at 
the 5* and 3' tennini, respectively. PGR primers ZC17438 (SEQ ID NO:42) 
and ZC 17439 (SEQ ID NO:43) were used with template plasmid containing 
the full-length human zsig51 cDNA. PGR primers ZC17950 (SEQ ID N0:44) 
and ZG17951 (SEQ ID NO:45) were used with template plasmid containing 
the full-length mouse zsig51 cDNA in a PGR reaction as follows; gS'^C for 5 
minutes; 15 cycles at 95°G for 1 min., 58°G for 1 min.. and 72°G for 1.5 min.; 
72*'G for 7 min.; followed by a 4^G soak. The PGR reaction products were 
loaded onto a 1.2% (low melt) agarose (SeaPlaque GTG; FMG, Rockland, 
ME) gel in TAE buffer. The 2sig51 PGR products were excised from the gel 
and purified using a spin column containing a silica gel membrane 
(QIAquick^ Gel Extraction Kit; Qiagen, Inc., Valencia, GA). The PGR 
products were then digested with Fsel and Asci. phenol/chloroform extracted, 
EtOH precipitated, and rehydrated in 20^1 TE (Tris/EDTA pH 8). The 390bp 
(human) and 387bp (mouse) 2sig51 fragments were, then ligated into the 
Fsel-AscI sites of the vector pMT12-8 (Example 10. below) and transformed 
into DH10B competent cells by electroporation. Glones containing 2sig51 
inserts were identified by plasmid DNA miniprep followed by digestion with 
Fsel and AscI. A positive clone was sequenced to insure that there were no 
deletions or other anomalies in the construct. DNA was prepared using a 
commercially available kit (obtained from Qiagen, Inc.) 

The zsig51 cDNA was released from the pTG12-8 vector using 
Fsel and AscI enzymes. The cDNA was isolated on a 1% low melt agarose 
gel and excised from the gel, and the gel slice was melted at 70*'G, extracted 
twice with an equal volume of Tris-buffered phenol, and EtOH precipitated. 
The DNA was resuspended in 10^1 H2O. The cDNA was ligated into 
pAGGMV shuttle vector (Microbix Biosystems, Inc. Ontario, Canada) in which 
the polylinker had been modified to include Fsel and AscI sites and 
transfonned into E coli host cells (Electromax DHIOB"^ cells; obtained from 
Life Technologies, Inc., Gaithersburg, MD) by electroporation. Glones 
containing zsigSI inserts were identified by plasmid DNA miniprep followed 
by digestion with Fsel and AscI. A large-scale preparation of DNA was made 
for transfection. 

The zsig51-containing shuttle vectors were co-transfected with 
E1 -deleted, adenovirus vector pJM17 (Microbix Biosystems, Inc.) into 293A 
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cells (Quantum Biotechnologies, Inc. Montreal, QC. Canada) that express the 
adenovirus E1 gene. The DMA was diluted up to a total volume of 50^1 with 
sterile HBS (150mM NaCI, 20mM HEPES). In a separate tube, 20 ^l DOTAP 
(Boehringer Mannheim, Img/ml) was diluted to a total volume of 100^] with 
HBS. The DNA was added to the DOTAP, mixed gently by pipeting up and 
down, and left at room temperature for 15 minutes. The media was removed 
from the 293A cells and washed with 5 ml seojm-free MEMalpha containing 
1mM sodium pyruvate, 0.1 mM MEM non-essential amino acids and 25mM 
HEPES buffer (all from Life Technologies, Inc.). 5 ml of serum-free MEM was 
added to the 293A cells and held at 37°C. The DNA/lipid mixture was added 
drop-wise to the T25 flask of 293A cells, mixed gently, and incubated at 
37*^0 for 4 hours. After 4 hours the media containing the DNA/lipid mixture 
was aspirated off and replaced with 5 ml complete MEM containing 5% fetal 
bovine serum. Vims propagation is conditional and is achieved only by 
growing the El-deleted virus in a cell line expressing the E1 gene. 

Ceils were maintained for 2-4 weeks until the recombination 
event occurred. (Recombinant virus is generated by homologous 
recombination of overlapping fragments of the viral genome in the pJMt7 
vector and the shuttle vector.) At that time, the host 293 cells were tysed by 
the virus, fomiing plaques of dead cells. Within 3-5 days the entire 
monolayer was completely lysed. The medium containing the viral lysate 
was collected and any remaining intact cells were iysed by repeated 
freeze/thaw cycles and the cell debris pelleted by centrifugation. 

The viral lysate was then plaque-purified according to the 
method of Becker et al.. Meth. Cell Biol. 43:161-189, 1994. Briefly, serial 
dilutions were prepared in DMEM containing 10% fetal bovine serum and 100 
U/ml penicillin/streptomycin, plated on to monolayers of 293 cells, and 
incubated at 37*^0 for one hour. A melted 1.3% agarose/water solution was 
mixed with 2X DMEM (containing 4% FBS, 200 U/ml penicillin/streptomycin, 
0.5 |ig/ml fungizone and 30 mg/ml phenol red), and 6 ml of the mixture was 
added to the virus-infected 293 cells followed by incubation at 37*'C until 
plaques fonned, 7-10 days. Single plaques were isolated, and the presence 
of the zsigSI insert was verified by PGR. The primers were ZC12700 (SEQ 
ID NO:48) and ZC12742 (SEQ ID NO:49). Amplification was carried out over 
30 cycles of 94°C, 0.5 minute, 55°C, 0.5 minute, and 72*^0, 0.5 minute; 
followed by a 10-minute extension at 72°C. One plaque each for human and 
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mouse 2sig51 that had the expected size PGR product was used to do a 
primary amplification. 

Ten 10-cm plates of nearly confluent (80-90%) 293A cells were set up 
20 hours previously. Roughly 5% of the virus lysate from a plaque was 
5 added to each 10-cm plate and monitored for 48 to 72 hours looking for CPE 
(Cytopathic Effect) under the white light microscope. When all of the 293A 
cells showed CPE. this 1° stock lysate was collected and 3 freeze/thaw 
cycles performed. 

For secondary (2'*) amplification of zsig51 rAdV, 20 15-cm 

10 tissue culture dishes of 293A cells were prepared so that the cells were 80- 
90% confluent. All but 20 mi of 5% MEM media was removed, and each dish 
was inoculated with 300-500 ml of the I*' amplified rAdv lysate. After 48 
hours the 293 A cells were lysed from virus production, and the lysate was 
collected into 250-ml polypropylene centrifuge bottles. 

15 To purify the recombinant virus, NP-40 detergent was added to 

a final concentration of 0.5% to the bottles of coide lysate to lyse all cells. 
Bottles were placed on a rotating platform for 10 min. agitating as fast as 
possible without the bottles falling over. The debris was pelleted by 
centrlfugation at 20.000 X G for 15 minutes. The supernatant was 

20 transferred to 250-ml polycarbonate centrifuge bottles, and 0.5 volumes of 
20% PEG8000/2.5M NaCI solution was added. The bottles were shaken 
overnight on ice. The bottles were centrifuged at 20,000 X G for 15 minutes, 
and supernatant was discarded into a bleach solution. Using a sterile cell 
scraper, the precipitate from 2 bottles was resuspended in 2.5 ml PBS. The 

2 5 resulting virus solution was placed in 2-mi microcentrifuge tubes and 

centrifuged at 14,000 X G for 10 minutes to remove any additional cell debris. 
The supernatant from the 2-ml microcentrifuge tubes was transfen'ed into a 
15-ml polypropylene snapcap tube and adjusted to a density of 1.34 g/ml with 
cesium chloride (CsCI). The volume of the virus solution was estimated, and 

3 0 0.55 g/ml of CsCI was added. The CsCI was dissolved, and 1 mi of this 

solution weighed 1.34 g. The solution was transferred to polycariDonate thick- 
walled centrifuge tubes 3.2ml (Beckman) and spun at (348,000 X G) for 3-4 
hours at 25°C. The virus formed a white band. Using wide-bore pipette tips, 
the virus band was collected. 
35 The virus from the gradient had a large amount of CsCI which 

was removed before it could be used on cells. Phannacia PD-10 columns 
prepacked with Sephadex G-25M (Pharmacia) were used to desalt the virus 
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preparation. The column was equilibrated with 20 ml of PBS, The virus was 
loaded and allowed to run into the column. 5 ml of PBS was added to the 
column, and fractions of 8-10 drops were collected. The optical densities of 
1:50 dilutions of each fraction were determined at 260 nm on a 
5 spectrophotometer. A clear absorbance peak was present between fractions 
7-12. These fractions were pooled, and the optical density (OD) of a 1:50 
dilution was detemiined. OD was converted to virus concentration using the 
formula (OD at 260nm)(50)(1.1 x 10^^) = virions/ml. The human zsigSI rAdV 
concentration was 6.1 X 10^^ virions/ml. The mouse 2sig51 virus 
10 concentration was 9.2 X 10^^. 

To store the virus, glycerol was added to the purified virus to a 
final concentration of 15%, mixed gently but effectively, and stored in aliquots 
at -80«C. 

A protocol developed by Quantum Biotechnologies, Inc. 

15 (Montreal, Qc. Canada) was followed to measure recombinant virus 
infectivity. Briefly, two 96-well tissue culture plates were seeded with 1X10^ 
293A cells per well in MEM containing 2% fetal bovine serum for each 
recombinant virus to be assayed. After 24 hours, 10-fold dilutions of each 
virus from 1X10'^ to 1X10'^"* were made in MEM containing 2% fetal bovine 

20 serum. lOOial of each dilution was placed in each of 20 wells. After 5 days at 
37^*0, wells were read either positive or negative for CPE, and a value for 
"Plaque Fonming Units/ml" (PFU) ws calculated. 

TCID50 fonmulation used was as per Quantum Biotechnologies, 
Inc., above. The titer (T) was determined from a plate where virus used was 

25 diluted from 10"^ to 10'^^, and read 5 days after the infection. At each dilution 
a ratio (R) of positive wells for CPE per the total number of wells was 
detemriined. 

Example 8: Adenovirus Administration of Zsiq51 to Normal Mice 
30 Human and mouse zsig51s were administered to mice using 

adenovirus vectors containing the coding region of the human gene (zsigSlh; 

SEQ ID N0:1) or its mouse orthologue (2sig51m; SEQ ID N0:31). The 

adenovirus vectors were injected intravenously into C57BI/6 mice according 

to the experimental design shown in Table 5. Blood was drawn and analyzed^ 
35 on day 12 and then again on day 21 of each experiment. All mice received 

bromodeoxyuridin (BrdU) in their drinking water three days before sacrifice. 

Animals were sacrificed on day 21. Parameters measured included weight 
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change, complete blood counts, serum chemistries, histology, organ weights, 
and cell proliferation measured by BrdU incorporation. 



Table 5 

5 

Group 1 2sig51 rAdV 

1 X 10" particles/dose 
10 females. 10 males 

1 0 Group 2 null AdV control 

1x10" particles/dose 
10 females, 10 males 



Group 3 no treatment 
15 5 females, 5 males 

Female glucose levels were higher in zsigSlh experimental 
groups than in null vims control groups. Both male and female glucose levels 
were lower in zsigSIm experimental groups than in null virus control groups. 
20 This lowered glucose was seen in zsigSIm mice both in a fasting state and in 
a well-fed state. 

The liver weights of both males and females in zsigSIm 
experimental groups were higher than in null virus control groups. 

25 ^ Example 9: In Situ Hvbridization 

Fresh tissue was fixed in 4% parafonnaldehyde at 4°C 
overnight. Tissue was embedded in paraffin using a standard protocol with 
the exception that Histoclear (National Diagnostics, Atlanta, GA) was 
substituted for xylene. 5-10 micron sections were mounted onto slides 

3 0 (Superfrost™ Plus; VWR Scientific, West Chester. PA), and the slides were 
baked at 37X for 4 hours, then stored at 4'C. Alternatively, fixed and 
sectioned tissues were obtained from commercial sources. The slides were 
de-waxed in Histoclear and rehydrated through an ethanol series. They were 
then air-dried and stored at -20**C. 

35 For in situ hybridization, slides were allowed to come to room 

temperature, washed 3 times for 5 minut s each in phosphate-buffered saline 
containing 0.1% polyoxyethylenesorbitan monolaurate (Tween 20) (PBT), 
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then incubated at 37'C in PBT with proteinase K at a concentration of 2-100 
ng/ml. Slid s were rinsed twice for 5 minutes in PBT, fixed again in 4% 
paraformaldehyde for 20 minutes at room temperature, and rinsed twice for 5 
minutes in PBT, Sections were acetylated by dipping the slides into a 
5 mixture of 197 ml deionized water, 2.6 ml triethanolamine, and 350 ^l 
hydrochloric add. 500 \xl of acetic anhydride was added drop by drop while 
stirring, and the incubation of the slides was continued in this mixture at room 
temperature for 10 minutes. Slides were rinsed twice for 5 minutes in PBT. 
then dehydrated through a methanol series and air-dried. 

10 The sections were hybridized with an in wfro-transcribed 

digoxigenin-iabeled RNA antisense zsig51 probe representing the complete 
protein coding region of the gene. Human tissues were probed with an 
antisense to tlie human gene (SEQ ID N0:2 human coding region only) and 
mouse tissues were probed with an antisense to the mouse orthologue (SEQ 

15 ID NO:32 mouse coding region only). The probes were used at a 
concentration of 200 ng to 1 ^igVml in 50% formamide. 10% dextran sulphate, 
5x SSC, 5x Denhardt's solution, 250 pg/ml yeast tRNA,-500 ^ig/ml salmon 
sperm DMA, and 50 iiig/ml heparin. Hybridization was done ovemight at 60- 
72*'C. The slides were washed, using solutions preheated at 60-72*^0, in 

2 0 50% fomiamide, 2X SSC, once for 15 minutes; then in a fresh wash for 30 
minutes; and finally in 25% formamide, 1X SSC, 0.5X PBS for 30 minutes. 
Slides were then rinsed twice in PBT at room temperature for 5 minutes. 

The sections were blocked in 5% nonfat dried milk. 4x SSC, 
and 0.1% Tween 20 (blocking solution) with 5% nomial rabbit or goat serum 

25 added for 1 hour at room temperature. Slides were then rinsed in PBT at 
room temperature three times for 5 minutes. Sheep anti-digoxigenin antibody 
(Boehringer Mannheim, Indianapolis, IN) was diluted 1:1000 in blocking 
solution, added to slides, and incubated for 30 minutes at room temperature. 
Slides were washed four times for 15 minutes each in blocking solution. 

30 Biotinylated rabbit-anti-sheep antibody (Vector Laboratories, Buriingame, CA) 
was diluted 1:200 in blocking solution with 7.5% nomial mouse serum added, 
and allowed to incubate at room temperature at least 30 minutes before 
being added to slides and incubated for an additional 30 minutes at room 
temperature. Slides were then washed two times for 5 minutes each in 

35 blocking solution, Avidin and biotinylated peroxidase (Vectastain® ABC-AP 
Kit; Vector Laboratories) w re prepared according to the manufacturer's 
instructions before being added to slides, which were tiien incubated for 30 



wo 99/41377 



PCTAJS99/03104 



60 

Fsel and AscI (Boerhinger Mannheim), ethanol precipitated, and ligated into 
PMT12-8 that was previously digested with Fsel and Asci. The pMT12-8 
plasmid, designed for expression of a gene of interest in transgenic mice, 
contains an expression cassette flanked by 10 kb of MT-1 5' DMA and 7 kb of 
5 MT-1 3' DNA. The expression cassette comprises the mouse MT-1 
promoter, a rat insulin II intron. a polylinker for the insertion of the desired 
clone, and the human growth honmone poly A sequence. 

About one microliter of each of the ligation mixtures was 
electroporated into £ coli host cells (Electromax DHIOB^^* cells; obtained 

10 from Life Technologies, Inc.. Gaithersburg, MD) according to supplier's 
directions and plated onto LB plates containing 100 jig/ml ampicillin. and 
incubated ovemight. Colonies were picked and grown in LB medium 
containing 100 jig/ml ampicillin. Miniprep DNA was prepared from the picked 
clones and screened for the zsigSI human or mouse insert by restriction 

15 digestion with EcoRI, and subsequent agarose gel electrophoresis. 
.Maxipreps of the correct pMT-zsig51 constructs were prepared. A Sail 
fragment containing with 5' and 3* flanking sequences, the MT-1 promoter, 
the rat insulin 11 intron, zsigSI human or mouse cDNA, and the human growth 
hormone poly A sequence was prepared and used for microinjection into 

20 fertilized mouse oocytes to generate transgenic founder animals. The 
original oocytes and the sperm used to fertilize them were from F1 hybrids of 
C3H and C57Bi/6 mice. In subsequent generations, the mice were mated at 
every generation to C57BI/6 mice. The human transgene was designated 
MTzsig51h, while the mouse transgene was designated MTzsigSlm. 

25 Five male and five female transgenic mice carrying MTzsigSlh 

were identified. The expression level of the transgene in the liver of each 
mouse was quantified by real-time RT-PCR on an ABI Prism 7700 Sequence 
Detector {Peri<in-Elmer). This analysis indicated that none of the males had 
a measurable level of expression. The expression profile of the females was: 

3 0 2 very high expressors (greater than 10,000 mRNA molecules/liver cell); 1 
high expressor (greater than 2,000 mRNA molecules/liver cell); 1 medium 
expressor (approximately 600 mRNA molecules/liver cell); and 1 low 
expressor (approximately 300 mRNA molecules/liver cell). 

Two male and ten female transgenic mice carrying MTzsigSlm 

35 were identified. The expression level of the transgen in the liver of each 
mouse was quantifi d as above. Neither of the males had a measurable 
level of expression. The expression profile of the females was: 5 very high 
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expressors (greater than 10,000 mRNA molecules/liver cell); 1 high 
expressor (greater than 2.000 nfiRNA molecules/liver ceil); 1 medium 
expressor (approximately 1.100 mRNA molecules/liver cell); 1 low expressor 
(approximately 300 mRNA molecules/liver cell); and 1 below the measurable 
5 level of expression. 

Parameters measured included weight change, fertility, 
complete blood counts, serum chemistries, histology, organ weights and cell 
proliferation measured by incorporation of bromodeoxyuridine (BrdU). All 
mice received BrdU in their drinking water three days before sacrifice. 
10 For the MTzsigSI h transgenics, male animals were sacrificed at 

approximately two months of age. while females were sacrificed at 
approximately 5 months, and tissues were collected for histological analysis. 
Tissue samples were fixed in 10% buffered fonnalin. embedded in paraffin, 
sectioned at 3 microns, and stained with hematoxylin and eosin. The slides 
15 were examined and scored by a board-certified veterinary pathologist. 
Female mice were found to have scattered small areas of mineralization in 
the heart, lung, skeletal muscle, and the growth plate of the femur. 

In the second generation of offspring that arose from matings to 
C57BI/6 mice, the highest-expressing line of MTzsig51h mice gave rise to 
2 0 transgenic females that displayed severe hair loss at weaning. 

MTzsig51M animals were sacrificed at approximately six 
months of age. Tissues were collected for histological analysis, fixed, 
examined, and scored as above. The highest expressing female MTzsigSI m 
mouse was infertile. 

25 • 

Example 11: Baculovirus expression 

C-terminal Glu-Glu tagged human zsigST is expressed in Sf9 
cells using a baculovims vector. The tagged protein comprises the peptide 
tag shown in (SEQ ID NO:46). The zsigSlcee sequence (as a EcoRI-BamHI 

30 fragment) is inserted into a modified pFastBac™ expression vector (Life 
Technologies) containing the late activating Basic Protein promoter. About 
90 nanograms of the zsigSlcee insert and about ISO ng of the digested 
vector are ligated overnight. The ligation mix is diluted 3-fold in TE (10 mM 
Tris-HCI, pH 7.S and 1 mM EDTA), and 4 fmol of the diluted ligation mix is 

35 transformed into competent E. coli cells (Library Efficiency DHSa™ 
competent cells; Life Technologies. Gaithersburg, MD) according to the 
manufacturer's directions by heat shock for 4S seconds in a 42*^0 waterbath. 
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The ligated DNA is diluted in 450 ^1 of SOC media (2% Bacto Tryptone, 0.5% 
Bacto Yeast Extract, 10 ml 1M NaCI. 1.5 mM KCI. 10 mM MgCI,. 10 mM 
MgS04 and 20 mM glucose) and plated onto LB plates containing 100 ng/ml 
ampicillin. Clones are analyzed by restriction digests, and 1 ^1 of a positive 
5 clone is transformed into 20 ^xl £ coli cells (Max Efficiency DHIOBac™ 
competent cells; Life Technologies) according to the manufacturer's 
instructions, by heat shock for 45 seconds in a 42*C waterbath. The ligated 
DNA is diluted in 980 ^1 SOC media and plated onto Luria Agar plates 
containing 50 ^lg/ml kanamycin, 7 iig/ml gentamycin, 10 ^g/ml tetracycline, 
10 IPTG and Bluo Gal. The cells are incubated for 48 hours at ST'^C. A color 
selection is used to identify those cells having virus that has incorporated into 
the plasmid (referred to as a "bacmid"). Those colonies, which are white in 
color, are picked for analysis. Bacmid DNA is isolated from positive colonies 
using commercially available reagents and equipment (QIAprep® 8 Miniprep 
15 Kit and QIAvac vacuum manifold; Qiagen, Inc.. Valencia, CA) according the 
manufacturer's directions. Clones are screened for the correct insert by 
amplifying DNA using primers to the Basic Protein promoter and to the SV40 
terminus via PCR. Those having the con-ect insert are used to transfect 
Spodoptera frugiperda (Sf9) cells. 
20 Sf9 cells are seeded at 5 x 10^ cells per 35 mm plate and 

allowed to attach for 1 hour at 27**C. Five microliters of bacmid DNA is 
diluted with 100 ^1 serum-free media (Sf-900 II SFM; Life Technologies). Six 
jil of a 1:1.5 (M/M) liposome formulation of the cationic lipid N, N*. N", N"'- 
tetmiethyl-N, N', N", N"Ltetrapalmitylspermine and dloleoyl 
25 phosphatidylethanolamine in membrane-filtered water (CellFECTIN™ 
reagent; Life Technologies) is diluted with 100 ^1 Sf-900 II SFM. The bacmid 
DNA and lipid solutions are gently mixed and incubated 30-45 minutes at 
room temperature. The media from one plate of cells are aspirated, the cells 
are washed IX with 2 ml fresh media. Eight hundred microliters of Sf-900 II 
30 SFM is added to the lipid-DNA mixture. The wash media is aspirated and the 
DNA-lipid mix added to the cells. The cells are incubated at 27**C for 4-5 
hours. The DNA-lipid mix is aspirated and 2 ml of Sf-900 II media containing 
penicillin/streptomycin is added to each plate. The plates are incubated at 
27*C, 90% humidity, for 96 after which the virus is harvested. 
3 5 Sf9 cells are grown in 50 ml Sf-900 II SFM in a 200 ml shake 

flask to an approximate density of 0.41-0.52 x 10® cells/ml. They are then 
transfected with 100 |il of the virus stock from above and incubated at 27^C 
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for 2-3 days aft r which time th virus is harvested. The titer for AcCSCFI is 
1.7 X 10^ pfu/ml and for AcCSNFI it is 2.6 x 10^ To scale up. 1.5 x 10® SF9 
cells/ml are added to five liters of SF 900 II SFM and grown for 91 hours. 
The cells are then transfected with the harvested virus (MOl 0.2) and 
5 incubated as above for 71 hours. 

Example 12: Protein Purification and Analvsis 

Glu-Glu tagged zsigSI protein (zsigSlcee) was produced in 
baculovirus infected cells essentially as disclosed above, and the protein was 

10 purified from cell-conditioned media by affinity chromatography. A 100 ml 
bed volume of immobilized protein G (protein G-Sepharose®; Pharmacia 
Biotech) was washed 3 times with 100 ml of PBS containing 0.02% sodium 
azide using a 500 ml Nalgene 0.45 micron filter unit. The gel was washed 
with 6.0 volumes of 200 mM triethanolamine. pH 8.2 (TEA; Sigma Chemical 

15 Co., St. Louis. MO), and an equal volume of anti-glu-glu antibody solution 
containing 900 mg of antibody was added. After an ovemight incubation at 
4*'C. unbound antibody was removed by washing the resin with 5 volumes of 
200 mM TEA as described above. The resin was resuspended in 2 volumes 
of TEA, transferred to a suitable container, and dimethylpimilimidate-2HCI 

20 (Pierce, Rockford, IL). dissolved in TEA, was added to a final concentration of 
36 mg/ml of gel. The gel was rocked at room temperature for 45 minutes, 
and the liquid was removed using the filter unit as described above. 
Nonspecific sites on the gel were then blocked by incubating for 10 minutes 
at room temperature with 5 volumes of 20 mM ethanolamine in 200 mM TEA. 

25 The gel was washed with 5 volumes of PBS containing 0.02% sodium azide 
and stored in this solution at 4''C. 

Unless othenA/ise noted, all purification steps were earned out at 
4'*C. A mixture of protease inhibitors was added to a 2 liter sample of 
conditioned media from baculovirus-infected Sf9 cells to final concentrations 

3 0 of 2.5 mM ethylenediaminetetraacetic acid (EDTA, Sigma Chemical Co.. St. 
Louis, MO), 0.001 mM leupeptin (Boehringer-Mannheim, Indianapolis, IN), 
0.001 mM pepstatin (Boehringer-Mannheim) and 0.4 mM 4-(2-Aminoethyl)- 
benzenesulfonyl fluoride hydrochloride (Pefabloc®; Boehringer-Mannheim). 
The sample was centrifuged at 10,000 rpm for 30 min at 4**C in a Beckman 

35 JLA-10.5 rotor (Beckman Instrum nts) in a Beckman Avanti J25I centrifuge 
(Beckman Instruments) to remove cell debris. To the supernatant fraction 
was added a 50.0 ml sample of anti-EE Sepharose, prepared as described 
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above, and the mixture was gently agitated on a Wheaton (MilMlle, NJ) roller 
culture apparatus for 18.0 h at 4**C. 

The mixture was poured into a 5.0 x 20.0 cm column (Econo- 
Column®; Bio-Rad Laboratories, Hercules. CA), and the gel was washed 
5 with 30 column volumes of phosphate buffered saline (PBS). The unretained 
flow-through fraction was discarded. Once the absorbance of the effluent at 
280 nM was less than 0.05, flow through the column was reduced to zero 
and the anti-EE Sepharose gel was washed with 2.0 column volumes of PBS 
containing 0.2 mg/ml of EE peptide having the sequence Glu-Tyr-Met-Pro- 
10 Val-Asp (SEQ ID NO:47) (AnaSpec, San Jose, CA). After 1,0 hour at 4°C, 
flow was resumed and the eluted protein was collected. This fraction was 
referred to as the peptide elution. The anti-EE Sepharose gel was washed 
with 2.0 column volumes of 0.1 M glycine, pH 2.5, and the glycine wash was 
collected separately. The pH of the glycine-eluted fraction was adjusted to 
15 7.0 by the addition of a small volume of 1 0X PBS and stored at 4**C. • 

The peptide elution was concentrated to 5.0 ml using a 5,000 
molecular weight cutoff membrane concentrator (MUlipore. Bedford, MA) 
according to the manufacturer's instructions. The concentrated peptide 
elution was separated from free peptide by chromatography on a 1.5 x 50 cm 
20 Sephadex® G-50 (Pharmacia Biotech) column equilibrated in PBS at a flow 
rate of 1.0 ml/min using a commercially available HPLC system 
(BioCad^/Sprint™ HPLC system; PerSeptive BioSystems, Framingham, 
MA). Two-ml fractions were collected and the absorbance at 280 nM was 
monitored. The first peak of material absorbing at 280 nM and eluting near 
25 the void volume of the column was collected. This material represented 
purified zsig51cee. The material was aliquoted and stored at -80*'C. 

The N-temiinal peptide of the recombinant zsig51cee was 
identified by mass spectrometry. Two fonns were found. The first had a 
mass of 2000.8. which indicated an N-terminal peptide with a pyro-glutamic 
30 acid instead of glutamine (residue 1 of SEQ ID N0:2). The second form had 
a mass of 3038, coaesponding to the first peptide with glycosylation 
consisting of 2HexNAc, 3Hex. and 1 deoxyhex. 

Example 13: Expression of ZsiQ51 in Mammalian Cells 
5 A mammalian expr ssion vector was constmcted with the 

dihyrofolate reductase g ne under control of the SV40 early promoter and 
SV40 polyadenylation site, and a cloning site to insert the gene of interest 
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und r control of the mouse MT-1 promoter and the hGH polyad nylation site. 
The expression vector was designated pZP-9 and was deposit d at the 
American Type Culture Collection, 12301 Parklawn Drive. Rockville, MD on 
February 20, 1998 under Accession Number 98668. To facilitate purification 
5 of the protein of interest the pZP-9 vector was modified by addition of the 
tPA leader sequence (U.S. Patent 5,641,655, incorporated herein by 
reference) and a GluGlu tag (SEQ ID NO:46) between the MT-1 promoter 
and hGH tenninator. Expression results in an N-terminally tagged fusion 
protein comprising the tPA leader. The N-temriinally tagged vector was 
0 designated pZP9NEE. 

BHK570 cells (ATCC CRL 10314) were plated on 10-cm tissue 
culture dishes and allowed to grow to approximately 50 to 70% confluency 
overnight. Unless othenvise specified the cells were handled using standard 
aseptic technique, grown at 37X. 5% CO2, in water-jacketed incubators 
(Model 3110; Fomia Scientific, Marietta, OH), and with the following media: 
DMEM (High Glucose, #11965-092; Life Technologies, Gaithersburg, MD), 
5% Fetal Bovine Serum (Hyclone, Logan, UT), 1% . L-Glutamine (JRH 
Biosciences, Lexena. KS), 1% sodium pyruvate (Life Technologies). The 
cells were transfected with plasmid pZP9/zsig51NEE. using a 3:1 (w/w) 
liposome fomiulation of the polycationic lipid 2,3-dioleyloxy-N- 
[2(spenninecarboxamido)ethyl]-N,N-dimethyl-1-propaniminium- 
trifiuoroacetate and the neutral lipid dioleoyi phosphatidylethanolamine in 
membrane-filtered water (Lipofectamine™ Reagent; Life Technologies), in a 
serum-free (SF) media fomiulation (DMEM, 10 mg/ml transfenin, 5 mg/ml 
insulin, 1 mg/ml fetuin, 1% L-Glutamine (JRH #59202-77P), 1% sodium 
pyruvate. 16 pg of plasmid was diluted in a 15-ml tube with 640 |jl SF media, 
and in a separate tube, 35 pi of Lipofectamine'^" was mixed with 605 pi of SF 
media. The Lipofectamine™ mix was added to the DNA mix and allowed to 
incubate approximately 30 minutes at room temperature. Five ml of SF 
medium was added to the DNA:Lipofectamine™ mixture. The cells were 
rinsed once with 5 ml of SF medium, aspirated, and the 
DNA:Lipofectamine'^ mixture was added. The cells were incubated at 37'C 
for five hours, then 6.4 ml of 10% FBS/DMEM medium was added to the 
plate. The plate was incubated at 37*'C ovemight, and the 
DNA;Lipofectamine™ mixture was replaced with fresh medium the next day. 
On day 2 after transfection. the cells were split into transfection media 
(standard media plus IpM methotrexate) in 150-mm plates at 1:50. 1:100, 
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and 1:200. The plates were re-fed at day 5 following transfection with fresh 
selection medium. Once resistant colonies reached 3-4 mm in diameter, a 
plate containing 70 to 150 colonies was selected for immunoassay. 

The plate of cells was rinsed with 10 ml SF media, then 5 ml SF 
5 media is added, followed by a nylon mesh, then a notched nitrocellulose filter, 
both pre-soaked in SF media. Matching alignment marks were made on the 
plate, and it was incubated for approximately 5 hours at 37'C. The filter and 
mesh were removed, and the cells were re-fed standard media plus penicillin- 
streptomycin-neomycin antibiotic mix (Life Technologies) and incubated at 

10 37**C. The filter was incubated with anti-GluGlu antibody conjugated to 
horseradish peroxidase, at 1:5000 dilution, in 2.5% non-fat dry milk Western 
A buffer for I hour at room temperature on a rotating shaker. The filter was 
washed three times at room temperature in PBS plus 0.01% Tween 20. 15 
minutes per wash. The filter was developed with commercially available 

15 reagents (ECL™ Western blotting detection reagents; Amersham Life 
Science Inc., Arlington Heights, IL) according to the manufacturer's 
directions and exposed to film (Hyperfilm^^ ECL,. Amersham) for 
approximately 5 minutes. The alignment marks on the filter were transferred 
to the film. 

20 The film was aligned with the marks on the plate of cells for 

selection of colonies with optimal signals. The colonies were circled and the 
media was removed from the plate. Sterile 3-mm cloning discs (PGM 
Scientific Corporation #62-6151-12) soaked in trypsin were placed on the 
colonies, then transferred to 200 pi of selection media in a 96-well dish. A 

25 series of seven, two-fold serial dilutions was carried out with the cells 
recovered from the disc. The cells were grown for one week at 37*'C. then 
expanded by selecting the well with the lowest dilution of cells at confluency 
for each clone for trypsinization and transferring ir to a 12 well dish containing 
selection media. 

30 The clones were expanded directly from the 12-well dish to two 

T75 flasks for each clone. One flask was maintained in selection media at 
37'*C. The other flask was used for Western blot analysis of the clones. The 
flask was first grown to confluency, then the medium was changed to SF. 
allowed to incubate for two days at 37*'C, harvested, and filtered at 0,22 pm. 

3 5 This flask is discarded after harvest. 

The conditioned medium was concentrated 10-fold by 
ultrafiltration, and analyzed by West rn blot. Three clones producing the 
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highest levels of protein were selected and samples wer frozen. The clones 
are pooled and transferred for large scale culture. 

Example 14: Expression analysis bv quantitative RT-PCR 
5 The expression level of zsigSI mRNA in a variety of human and 

mouse tissues was quantified by real-time PGR (RT-PCR) on an ABI Prism 
7700 Sequence Detector (Peri<in-Elmer. NonA/alk. CT) following the 
manufacturer's protocols. Total RNA was prepared from fresh mouse tissues 
using a commercially available kit (RNeasy® kit; Qiagen). Human total RNA 
10 samples were purchased from a variety of commercial sources (Invitrogen, 
Clontech). 

Expression of zsig51 mRNA in the pancreas of diabetic NOD 
mice was found to be significantly higher than in the pancreas of non-diabetic 
NOD mice. Expression of zsig51 in the pancreas of fasted mice was found to 
15 be lower than in the pancreas of well-fed mice. Expression of zsig51 in the 
mouse eye was found to be approximately 50-fold higher than expression in 
normal pancreas. 

From the foregoing, it will be appreciated that, although specific 
20 embodiments of the invention have been described herein for purposes of 
illustration, various modifications may be made without deviating from the 
spirit and scope of the invention. Accordingly, the invention is not limited 
except as by the appended claims. 
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CLAIMS 

We claim: 

1. An isolated polypeptide comprising at least 15 consecutive 
amino acid residues of SEQ ID N0:2. 

2. An isolated polypeptide that is at least 80% identical in amino 
acid sequence to residues 1 through 106 of SEQ ID N0:2, said polypeptide 
comprising cysteine residues at positions corresponding to residues 8. 34, 38, 66, 
96, and 98 of SEQ ID N0:2; a glycine residue at a position corresponding to residue 
36 of SEQ ID N0:2; and beta strand-like regions con-esponding to residues 9-17, 29- 
34, 38-43, 59-64. 67-71 , and 90-95 of SEQ ID N0:2. 

3. The isolated polypeptide according to claim 2 further comprising 
cysteine residues at positions con-esponding to residues 25, 65. 80, and 101 of SEQ 
ID N0:2. 

4. The isolated polypeptide according to claim 2 wherein amino 
acid residues con-esponding to residues 8, 11. 12, 14, 29, 30, 32, 34, 43, 44, 60, 63. 
64. 65. 71, 74, 80, 90, 91. 93. and 94 of SEQ ID N0:2 are Cys. His, Pro. Asn, His, 
Val, Gin, Cys, Phe. Pro, Thr. Ser. Gin. Cys. Leu. Val. Cys. He. Phe, Ala. and Arg, 
respectively; and an amino acid residue corresponding to residue 75 of SEQ ID 
NO:2 is Lys or Arg. 

5. The isolated polypeptide according to claim 2 wherein said 
polypeptide is at least 95% identical to residues 1 through 106 of SEQ ID N0:2. 

6. The isolated polypeptide according to claim 2 comprising 
residue 1 through residue 106 of SEQ ID N0:2 or residue 1 through residue 106 of 
SEQ ID NO:29. 

7. The isolated polypeptide according to claim 2. covalently linked 
to an affinity tag. 

8. The isolated polypeptide according to claim 2. covalently linked 
to an immunoglobulin constant region. 
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9. An isolated protein comprising a first polypeptide according to 
any of claims 2-8 complexed to a second polypeptide, wherein said protein 
modulates cell proliferation, differentiation, or metabolism. 

10. The isolated protein according to claim 9 wherein said protein is 
a heterodimer. 

11. The isolated protein according to claim 10 wherein said second 
polypeptide is a glycoprotein hormone common alpha subunit. 

12. The isolated protein according to claim 9 wherein said first 
polypeptide comprises residue 1 through residue 106 of SEQ ID N0:2 or residue 1 
through residue 106 of SEQ ID NO:29. 

13. The isolated protein according to claim 9 wherein said protein is 

a,homodimer. 

14. The isolated protein according to claim 13 wherein each of said 
first and second polypeptides comprises residue 1 through residue 106 of SEQ ID 
N0:2 or residue 1 through residue 106 of SEQ ID NO:29. 

15. An isolated polynucleotide encoding a polypeptide according to 
any of claims 2-8. 

16. The isolated polynucleotide according to claim 15 comprising a 
sequence of nucleotides as shown in SEQ ID N0:4 from nucleotide 70 through 
nucleotide 387 or SEQ ID NO:30 from nucleotide 70 through nucleotide 387. 

17. The isolated polynucleotide according to claim 18 comprising a 
sequence of nucleotides as shown in SEQ ID N0:1 from nucleotide 125 through 
nucleotide 442. 

18. The isolated polynucleotide according to claim 15 wherein said 
polynucleotide is from 318 to 1000 nucleotides in length. 
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obtaining a genetic sample from a patient; 

incubating th _ g netic sample with a polynucleotide comprising at least 
14 contiguous nucleotides of SEQ ID N0:1 or the complement of SEQ ID N0:1. 
under conditions wherein said polynucleotide will hybridize to complementary 
polynucleotide sequence, to produce a first reaction product; 

comparing said first reaction product to a control reaction product, 
wherein a difference between said first reaction product and said control reaction 
product is indicative of a genetic abnonnality in the patient. 

28. An oligonucleotide probe or primer comprising 14 contiguous 
nucleotides of a polynucleotide of SEQ ID N0:4 or a sequence complementary to 
SEQ ID N0:4. 
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AMENDED CLAIMS 

[received by the International Bureau on 12 July 1999 (12.07.99); 
original claims 17 and 20 amended; remaining claims unchanged (2 pages)] 

9. An isolated protein comprising a first polypeptid according to 
any of claims 2-8 complexed to a second polypeptide, wherein said protein 
modulates cell proliferation, differentiation, or metabolism. 

10. The isolated protein according to claim 9 wherein said protein is 
a heterodimer. 

11. The isolated protein according to claim 10 wherein said second 
polypeptide is a glycoprotein hormone common alpha subunit 

12. The isolated protein according to claim 9 wherein said first 
polypeptide comprises residue 1 through residue 106 of SEQ ID N0:2 or residue 1 
through residue 106 of SEQ ID NO:29. 

13. The isolated protein according to claim 9 wherein said protein is 

a homodimer. 

14. The isolated protein according to claim 13 wherein each of said 
first and second polypeptides comprises residue 1 through residue 106 of SEQ ID 
N0:2 or residue 1 through residue 106 of SEQ ID NO:29. 

15. An isolated polynucleotide encoding a polypeptide according to 
any of claims 2-8. 

16. The isolated polynucleotide according to claim 15 comprising a 
sequence of nucleotides as shown in SEQ ID N0:4 from nucleotide 70 through 
nucleotide 387 or SEQ ID NO:30 from nucleotide 70 through nucleotide 387. 

17. The isolated polynucleotide according to claim 15 comprising a 
sequence of nucleotides as shown in SEQ ID N0:1 from nucleotide 125 through 
nucleotide 442. 

18. The isolated polynucleotide according to claim 15 wherein said 
polynucleotide is from 318 to 1000 nucleotides in length. 
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19. The isolated polynucleotide according to claim 15 wherein said 
polynucleotide is DNA. 

20. An expression vector comprising the following operabfy linked 

elements: 

a transcription promoter 

a DNA segment encoding a polypeptide according to any of claims 2-8; 

and 

a transcription terminator. 

21. The expression vector according to claim 20 wherein said DNA 
segment further encodes a secretory peptide operably linked to said polypeptide. 

22. The expression vector according to claim 21 wherein said DNA 
segment encodes residue -23 through residue 106 of SEQ ID N0:2 or residue -23 
through residue 106 of SEQ ID NO:29, 

23. A cultured cell into which has been introduced an expression 
vector according to claim 20, wherein said cell expresses the polypeptide encoded 
by the DNA segment. 

24. A phamnaceutical composition comprising a polypeptide 
according to any of claims 2-8 in combination with a phamiaceuticaily acceptable 
vehicle. 

25. A method of producing a polypeptide comprising: 

culturing a cell into which has been introduced an expression vector 
according to claim 20, whereby said cell expresses the polypeptide encoded by the 
DNA segment; and 

recovering the expressed polypeptide. 

26. An antibody that specifically binds to an epitope of a polypeptide 
according to claim 2. 

27. A method for detecting a genetic abnorniaiity in a pati nt, 

comprising: 
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19. The isolated polynucleotide according to claim 15 wh rein said 
polynucleotide is DNA. 

20. An expression vector comprising the following operably linked 

elements: 

a transcription promoter; 

a DNA segment encoding a polypeptide according to any of claims 2-8; 

and 

a transcription terminator. 

21. The expression vector according to claim 20 wherein said DNA 
segment further encodes a secretory peptide operably linked to said polypeptide. 

22. The expression vector according to claim 21 wherein said DNA 
segment encodes residue -23 through residue 106 of SEQ ID N0:2 or residue -23 
through residue 106 of SEQ ID NO:29. 

23. A cultured cell into which has been introduced an expression 
vector according to claim 20, wherein said cell expresses the polypeptide encoded 
by the DNA segment. 

24. A phamnaceutical composition comprising a polypeptide 
according to any of claims 2-8 in combination with a phamiaceutically acceptable 
vehicle. 

25. A method of producing a polypeptide comprising: 

culturing a cell into which has been introduced an expression vector 
according to claim 20, whereby said cell expresses the polypeptide encoded by the 
DNA segment; and 

recovering the expressed polypeptide. 

26. An antibody that specifically binds to an epitope of a polypeptide 
according to claim 2. 

27. A method for detecting a genetic abnomiality in a patient. 

comprising: 
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1/3 - 

Hydrophobic . . Hydrophilic 

:3 -2 -1 0 1 2 3 
, .., , 

2 0.34 

3 0.16 

4 -0.03 

5 -0.31 

6 -0.77 

7 -1.27 

8 -1.60 

9 -1.83 

10 -1.78 

11 -1.83 

12 -1.62 

13 -1.48 

14 -1.25 

15 -0.45 

16 -0.28 

17 -0.55 

18 -0.47 

19 -0.18 

20 0.38 

21 -0.20 

22 -0.37 

23 -0.10 

24 -0.10 

25 -0.13 

26 -0.80 

27 -0.80 

28 -0.85 

29 -0.63 

30 -0.63 

31 -1.05 

32 -0.85 

33 -1.02 

34 -0.78 

35 -0.95 

36 -0.45 

37 0.02 

38 0.48 

39 1.23 

40 1.33 

41 1.58 

42 1.02 

43 0.80- 

44 0.33 

45 -0.17 

46 -0,15 

47 -0.23 

48 -0.42 

49 -0.33 

50 -0.33 

51 -0.42 

52 -0.63 
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53 -0.80 ,^==^^=^^^V 

54 -0.55 ======A 

55 -0.55 ======Q 

56 -0.75 ==: = = = = r:=A 

57 -0.17 ==C 

58 0 . 05 • V= 

59 0.35 G==== 

60 0.27 H=== 

61 -0.07 =C 

62 0.10 E= 

63 -0.35 ====S 

64 0.10 S= 
- 65 0,07 A= 

66 0.20 F== 

67 0.37 P==== 

68 0.07 S= 

69 -0.23 ==R 

70 -0.82 ========y 

71 -0.78 .========5 

72 -0.83 ========v 

73 -0.57 ======L 

74 0,23 V== 

75 0.40 A==== 

76 0.52 S=====: 
. 77 0.17 G==' " 

78 0.10 y= 

79 0.13 R= 

80 -0.62 ===:===H 

81 -0.48 =====N 

82 -0.48 =====1 

83 -0.35 =====T 

84 -0.45 =====3 

85 -1.07 ===========V 

86 -1.12 ===== = :r====S 

87 -1.12 ===========0 

88 -1.15 ============0 

89 -1,28 =============c 

90 -0.62 ======t 

91 0.45 1=====: 

92 0.50 S===== 

93 0.95 0========== 

94 0.70 L======= 

95 1.03 K========== 

96 0.23 K== 

97 -0.23 ==V 

98 -0.15 =K 

99 -0.90 =========:V 

100 -0.65 =======Q 

101 -0,63 ======L 

102 0.17 Q== 

103 0.63 C====== 

104 1,30 V= =========:=:== 

105 2.05 0= ====:=============== = 

106 1.75 5=======^===.======= 
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SEQUENCE LISTING ' 

<110> ZymoGenetics. Inc. 

<120> NOVEL CYSTINE KNOT PROTEIN AND MATERIALS 
AND METHODS FOR MAKING IT 

<130> 97-65PC 

<I50> US 09/023.570 
<I51> 1998-02-13 

<160> 49 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 746 
<212> DNA 

. <213> Homo sapiens •. .. 

<220> 

<221> CDS 

<222> (56) . . . (442) 

<221> s1g_pepticle 
<222> (56)... (127) 

<400> 1 

ccagcaggag gcacaggaaa actgcaagcc gctctgttcc tgggcctcgg aagtg atg 58 

Met 



cct atg gcg tec cct caa acc ctg gtc etc tat ctg ctg gtc ctg gca 106 

Pro Met Ala Ser Pro Gin Thr Leu Val Leu Tyr Leu Leu Val Leu Ala 
-20 -15 -10 

gtc act gaa gcc tgg ggc cag gag gca gtc ate cca ggc tgc cac ttg 154 

Val Thr Glu Ala Trp Gly Gin Glu Ala Val He Pro.Gly Cys His Leu 

-5 1 5 10 

cac ccc ttc aat gtg aca gtg cga agt gac cgc caa ggc acc tgc cag 202 

His Pro Phe Asn Val Thr Val Arg Ser Asp Arg Gin Gly Thr Cys Gin 

15 20 25 
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ggc tec cac gtg gca cag gcc tgt gtg ggc cac tgt gag tec age gee 250 
Gly Ser His Val Ala Gin Ala Cys Val Gly His Cys Glu Ser Ser Ala 
30 35 40 

ttc cct tct egg tae tet gtg etg gtg gee agt ggt tae cga eac aac 298 
Phe Pro Ser Arg Tyr Ser Val Leu Val Ala Ser Gly Tyr Arg His Asn 
45 50 55 

ate aec tec gtc tct cag tgc tge ace ate agt ggc etg aag aag gtc 346 
He Thr Ser Val Ser Gin Cys Cys Thr He Ser Gly Leu Lys Lys Val 
60 65 70 

aaa gta cag etg cag tgt gtg ggg age egg agg gag gag etc gag ate 394 
Lys Val Gin Leu Gin Cys Val Gly Ser Arg Arg Glu Glu Leu Glu He 
75 80 85 90 

ttc aeg gcc agg gcc tge cag tgt gac atg tgt cgc etc tct cgc tae 442 
Phe Thr Ala Arg Ala Cys Gin Cys Asp Met Cys Arg Leu Ser Arg Tyr 
95 100 105 

tag eccatcetct cccctccttc ctcccctggg tcacaggget tgacattctg 495 



gtgggggaaa cctgtgttea agattcaaaa actggaagga getceagcee tgatggttac 555 

ttgctatgga atttttttaa ataaggggag ggttgttcca getttgatcc tttgtaagat 615 

tttgtgactg teacctgaga agaggggagt ttctgcttet tecctgecte tgcctggccc 675 

ttctaaacca atetttcatc attttaette ectctttgee ettaccccta aataaagcaa 735 

gcagttcttg a 746 

<210> 2 

<211> 129 

<212>"PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> (1)...(23) 

<40G> 2 

Met Pro Met Ala Ser Pro Gin Thr Leu Val Leu Tyr Leu Leu Val Leu 

-20 -15 -10 

Ala Val Thr Glu Ala Trp Gly Gin Glu Ala Val He Pro Gly Cys His 
-5 1 5 
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Leu His Pro Phe Asn Val Thr Val Arg Ser Asp Arg Gin Gly Thr Cys 
10 15 20 25 

Gin Gly Ser His Val Ala Gin Ala Cys Val Gly His Cys Glu Ser Ser 

30 35 40 

Ala Phe Pro Ser Arg Tyr Ser Val Leu Val Ala Ser Gly Tyr Arg His 

45 50 55 

Asn He Thr Ser Val Ser Gin Cys Cys Thr lie Ser Gly Leu Lys Lys 

60 65 70 

Val Lys Val Gin Leu Gin Cys Val Gly Ser Arg Arg Glu Glu Leu Glu 

75 80 85 

He Phe Thr Ala Arg Ala Cys Gin Cys Asp Met Cys Arg Leu Ser Arg 

90 95 100 105 

Tyr 



<210> 3 
<211> 109 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> VARIANT 

<222> (2)... (26) 

<223> Xaa is any amino acid 

<221> VARIANT 
<222> (27)... (34) 

<223> Xaa is any amino acid or not present 

<221> VARIANT 

<222> (40)... (54) 

<223> Xaa is any amino acid 

<221> VARIANT 
<222> (55)... (72) 

<223> Xaa is any amino acid or not present 

<221> VARIANT 

<222> (74)... (93) 

<223> Xaa is any amino acid 

<221> VARIANT 
<222> (94)... (106) 

<223> Xaa is any amino acid or not present 



wo 99/41377 



PCT/US99/03104 



4 



<223> polypeptide motif 



<400> 3 
Cys Xaa Xaa Xaa Xaa 
1 5 
Xaa Xaa Xaa Xaa Xaa 
20 

Xaa Xaa Cys Xaa Gly 
35 

Xaa Xaa Xaa Xaa Xaa 
50 

Xaa Xaa Xaa Xaa Xaa 
65 

Xaa Xaa Xaa Xaa Xaa 
85 

Xaa Xaa Xaa Xaa Xaa 
100 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

10 15 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

25 30 
Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

40 45 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

55 60 
Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
70 75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

90 95 
Xaa Xaa Xaa Xaa Xaa Cys Xaa Cys 
105 



<210> 4 
. <211> 387 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> sig_peptide 
<222> (1)...(69) 

<223> degenerate sequence 

<221> variation 

<222> ,(1)...(387) 

<223> n is any nucleotide 

<400> 4 

atgccnatgg cnwsnccnca racnytngtn ytntayytny tngtnytngc ngtnacngar 60 

gcntggggnc argargcngt nathccnggn tgycayytnc ayccnttyaa ygtnacngtn 120 

mgnwsngaym gncarggnac ntgycarggn wsncaygtng cncargcntg ygtnggncay 180 

tgygarwsnw sngcnttycc nwsnmgntay wsngtnytng tngcnwsngg ntaymgncay 240 

aayathacnw sngtnwsnca rtgytgyacn athwsnggny tnaaraargt naargtncar 300 

ytncartgyg tnggnwsnmg nmgngargar ytngaratht tyacngcnrag ngcntgycar 360 

tgygayatgt gymgnytnws nmgntay 387 

<210> 5 
<211> 17 
<212> ONA 
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<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (1)...(17) 

<223> n is any nucleotide 

<400> 5 
cntgygtngg ncaytgy 

<210> 6 
<211> 17 
<212> ONA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (!)...( 17) 

<223> n is any nucleotide 

<400> 6 
nntgydnngg nbvntgy 

<210> 7 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (1). ..(17) 

<223> n is any nucleotide 

<400> 7 
nnacrhnncc nvbnacr 

<210> 8 
<211> 17 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (1)...(17) 

<223> n is any nucleotide 

<400> 8 
mgngcntgyc artgyga 

<2]0> 9 
<211> 17 
<212> ONA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (1)...(17) 

<223> n is any nucleotide 

<400> 9 
nnnnvntgyv rntgydv 

<210> 10 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (1)...(17) 

<223> n is any nucleotide 



<400> 10 

nnnnbnacrb ynacrhb 17 

<210> 11 
<211> 17 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (1).. .(17) 

<223> n is any nucleotide 

<400> 11 

tgycartgyg ayatgtg 17 

<210> 12 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

.<223> oligonucleotide primer 

<221> variation 

<222> (1)...(17) 

<223> n is any nucleotide 

<400> 12 

tgycantgyg anrwrtg 17 

<210> 13 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (1)...(17) 

<223> n is any nucleotide 

<400> 13 

acrgtnacrc tnywyac 17 

<210> 14 
<211> 17 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (1)...(17) 

<223> n is any nucleotide 

<400> 14 

cntgygtngg ncaytgy 2; 

<210> 15 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

. <223> oligonucleotide primer 

<221> variation 

<222> (1)...(17) 

<223> n is any nucleotide 

<400> 15 

sntgygwngg ncaytgy 17 

<210> 16 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (1)...(17) 

<223> n is any nucleotide 

<400> 16 

snacrcwncc ngtracr 17 

<210> 17 
<211> 17 
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<212> DMA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (1).. .(17) 

<223> n is any nucleotide 

<400> 17 
wsncartgyt gyacnat 

<210> 18 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

.<223> oligonucleotide primer 

<221> variation 

<222> (1).. .(17) 

<223> n is any nucleotide 

<400> 18 
wsncantgyt gymsnmy 

<210> 19 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (1)...(17) 

<223> n is any nucleotide 

<400> 19 
wsngtnacra crksnkr 

<210> 20 
<211> 17 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (!)...( 17) 

<223> n is any nucleotide 

<400> 20 

cayccnttya aygtnac 17 

<210> 21 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

.<223> oligonucleotide primer 

<221> variation 

<222> (1)...(17) 

<223> n is any nucleotide 

<400> 21 

mrncmnywyw aygtnrm 17 

<210> 22 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 

<221> variation 

<222> (!)...( 17) 

<223> n is any nucleotide 

<400> 22 

kyngknrwrw trcanyk 17 

<210> 23 
<211> 48 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 
<400> 23 

gtctgggttc gctactcgag gcggccgcta tttttttttt tttttttt 48 

<210> 24 
<211> 472 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> expressed sequence tag 
<400> 24 

ccagcaggag gcacaggaaa actgcaagcc gctctgttcc tgggcctcgg aagtgatgcc 60 
tatggcgtcc cctcaaaccc tggtcctcta tctgctggtc ctggcagtca ctgaagcctg 120 
gggccaggag gcagtcatcc caggctgcca cttgcacccc ttcaatgtga cagtgcgaag 180 
tgaccgccaa ggncacctgc cagggctccc acgtggcaca ggcctgtgtg ggccactgtg 240 
agtccagcgc cttcccttct cggtactctg tgctggtggc cagtggttac cgacacaaca 300 
tcacctccgt ctctcagtgc tgcaccatca gtggcctgaa gaagtcaaag tacagctgca 360 
gtgtgtgggg agccggaggg aggagtcgag atcttcaggc cagggctgcc atgtgacatg 420 
tgtcgcctct ctcgctacta gcccatcctc tcccctcctt cctcccctgg gg 472 

<210> 25 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 
<400> 25 

ccagcacaga gtaccgagaa gg 22 

<210> 26 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 
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<400> 26 

tggtcctggc agtcactgaa gc 22 

<210> 27 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 
<400> 27 

ggcctgccag tgtgacat 18 

<210> 28 
<211> 18 
<212> DNA 

<213> Artificial Sequence . 
<220> 

<223> oligonucleotide primer 
<400> 28 

cccccaccag aatgtcaa 18 

<210> 29 
<211> 129 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> SIGNAL 
<222> (1)...(23) 

<223> engineered variant 

<400> 29 

Met Pro Met Ala Ser Pro Gin Thr Leu Val Leu Tyr Leu Leu Val Leu 

-20 -15 -10 

Ala Val Thr Glu Ala Trp Gly Gin Glu Ala Val He Pro Gly Cys His 

-5 . 1 5 

Leu His Pro Phe Asn Val Thr Val Arg Ser Asp Arg Gin Gly Thr Cys 
10 15 20 25 
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Gin Gly Sen His Val Ala Gin Ala Cys Val Gly His Cys Glu Ser Ser 

30 35 40 

Ala Phe Pro Ser Arg Tyr Ser Val Leu Val Ala Ser Gly Tyr Arg His 

45 50 55 

Asn He Thr Ser Val Ser Gin Cys Cys Thr He Ser Gly Leu Lys Lys 

60 65 70 

Val Lys Val Gin Leu Gin Cys Val Gly Ser Arg Arg Glu Glu Leu Glu 

75 80 85 

He Phe Thr Ala Arg Ser Cys Gin Cys Asp Met Cys Arg Leu Ser Arg 
90 95 100 105 

Tyr 



<210> 30 
<211> 387 
<212> DNA 

<213> Artificial Sequence 
<220> 

. <223> degenerate sequence ■. .. 

<221> variation 

<222> (1)...(387) 

<223> n is any nucleotide 

<400> 30 

atgccnatgg cnwsnccnca racnytngtn ytntayytny tngtnytngc ngtnacngar . 60 

gcntggggnc argargcngt nathccnggn tgycayytnc ayccnttyaa ygtnacngtn 120 

mgnwsngaym gncarggnac ntgycarggn wsncaygtng cncargcntg ygtnggncay 180 

tgygarwsnw sngcnttycc nwsnmgntay wsngtnytng tngcnwsngg ntaymgncay 240 

aayathacnw sngtnwsnca rtgytgyacn athwsnggny tnaaraargt naargtncar 300 

ytncartgyg tnggnwsnmg nmgngargar ytngaratht tyacngcnmg nwsntgycar 360 

tgygayatgt gymgnytnws nmgntay 387 

<210> 31 
<211> 589 
<212> DNA 
<213> Mus musculus 

<220> 

<221> CDS 

<222> (36)... (422) 



<400> 31 
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ggagtcttca gttgctgttg gactgtcctt tgcag atg ccc atg gca cca cga 53 

Met Pro Met Ala Pro Arg 
1 5 



gtc ttg etc ctt tgc ctg ctg ggc ctg gca gtc act gaa ggg cat age 
vai Leu Leu Leu Cys Leu Leu Gly Leu Ala Val Thr Glu Gly His Ser 
10 15 20 



101 



cca gag aca gcc ate cca ggc tgc cac ttg cac ccc ttc aat gtg aeg 149 
Pro Glu Thr Ala He Pro Gly Cys His Leu His Pro Phe Asn Val Thr 
25 30 35 

gtg cgc agt gat cgc etc ggc act tgc cag ggc tec cac gtg gca cag 197 
Val Arg Ser Asp Arg Leu Gly Thr Cys Gin Gly Ser His Val Ala Gin 
40 45 50 

gcc tgt gta gga cac tgt gag tct agt get ttc cet tec egg tac tet 245 
Ala Cys Val Gly His Cys Glu Ser Ser Ala Phe Pro Ser Arg Tyr Ser 
55 60 65 70 

gtg ctg gtg gcc agt ggc tat egg cac aac ate ace tet tec tec cag 293 
Val Leu Val Ala Ser Gly Tyr Arg His Asn He Thr Ser Ser Ser Gin 
75 80 85 

tgc tgc ace ate age age etc aga aag gtg agg gtg tgg ctg cag tgc 341 
Cys Cys Thr He Ser Ser Leu Arg Lys Val Arg Val Trp Leu Gin Cys 
90 95 100 

gtg ggg aac cag egt ggg gag ctt gag ate ttt act gca agg gee tgc 389 
Val Gly Asn Gin Arg Gly Glu Leu Glu He Phe Thr Ala Arg Ala Cys 
105 no 115 

cag tgt gat atg tgc cgt ttc tee cgc tac tag teccegaagg geteaggctc 442 
Gin Cys Asp Met Cys Arg Phe Ser Arg Tyr * 
120 125 

eggtcctgee aetgacatgt catgggtatc teaaaetcgg ggctctgace etctttatcg 502 
tctgtgaaga tgaggttggc ectctcagea gteteettgc tacattette ettegctcct 562 
gtccteaata aageaageaa tgcttga 589 

<210> 32 
<211> 128 
<212> PRT 
<213> Mus museulus 
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<400> 32 

Met Pro Met Ala Pro Arg Val Leu Leu Leu Cys Leu Leu Gly Leu Ala 
1.5 10 15 

Val Thr Glu Gly His Ser Pro Glu Thr Ala He Pro Gly Cys His Leu 

20 25 30 

His Pro Phe Asn Val Thr Val Arg Ser Asp Arg Leu Gly Thr Cys Gin 

35 40 45 

Gly Ser His Val Ala Gin Ala Cys Val Gly His Cys Glu Ser Ser Ala 

50 55 60 

Phe Pro Ser Arg Tyr Ser Val Leu Val Ala Ser Gly Tyr Arg His Asn 
65 70 75 80 

He Thr Ser Ser Ser Gin Cys Cys Thr He Ser Ser Leu Arg Lys Val 

85 90 95 

Arg Val Trp Leu Gin Cys Val Gly Asn Gin Arg Gly Glu Leu Glu He 

100 105 110 

Phe Thr Ala Arg Ala Cys Gin Cys Asp Met Cys Arg Phe Ser Arg Tyr 
115 120 125 

<210> 33 
.<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 
<400> 33 

gtcggtgctc agcattcact actcgagggt tttttttttt ttttttt 47 

<210> 34 
<211> 18 
<212> ONA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 
<400> 34 

cctgggcctc ggaagtga 18 

<210> 35 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> oligonucleotide primer 
<400> 35 

aaactcccct cttctcag 18 

<210> 36 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 
<400> 36 

cgtaatacga ctcactatag ggcgaattgg 30 

<210> 37 
<211> 27 
.<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 
<400> 37 

gaaacagcta tgaccatgat tacgcca 27 

<210> 38 

<211> 767 

<212> DNA 

<213> Rattus norvegicus 

<220> 
<221> CDS 

<222> (203)... (595) 
<400> 38 

gcaggaggca cctggagtct acagttcctg ccggactgag tagctgaggc aaggaagcaa 60 
gcaccccaca cattcccacc caaggcagag aggatcaaca gtgccaccca ggcacacctc 120 
acagtcggaa gacccagaag cctggcttgc tgggggagag acacaactgc aaagacttcc 180 
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cttcccaccc actccttttc ag atg ccc atg gca cct cga gtc ttg etc ttc 232 

Met Pro Met Ala Pro Arg Val Leu Leu Phe 
1 5 10 



tqc ctq ctq 


aat 


ctg 


gca 


ate 


act 


gaa 


aaa cat 

yyy i> 




uLy 


aaa 
yay 


ara 


nrr 




Cvs Leu Leu 


Glv 


Lpu 


Ala 


Val 


Thr 
1 1 1 1 


Glu 

VJ 1 LI 


Glv Hi<; 


Glv 


I PI i 


Gin 


Ala 
M I a 


Ala 








15 










20 








25 






Qtc cca ate 


eea 


aac 

33^ 


tge 




tta 


car 


ccr tff 




y ty 






raa 
^yo 




Val Pro lie 

*ui iiv/ iic^ 


Pro 


Glv 


Cvs 


His 




n 1 J 


Pro Php 




Val 


Thr 
till 


Val 


Ml y 






30 










35 








40 








aot aat cac 

V4^V« ^^UW 


cat 


aac 




tar 


\«uy 




trr rat 


ata 


ara 
y 


raa 


ara 

y^y 


tat 

Ly L. 


Of Q 


Ser AsD Aro 


His 


Glv 


Thr 


Cvs 


Gin 


Glv 


Ser His 


Val 


Ala 


Gin 


Ala 


fvc: 




45 










50 








55 










qta qoa cac 


tgt 


aaa 

3"3 


tet 


agt 


act 

yuL 


ttc 


cct tct 


caa 

v#yy 


tac 


tct 


ata 
y uy 


rta 


424 


Val Glv His 


Cvs 


Glu 






Ala 


Phe 


ppn ^pr 


Ara 


Tvr 
\y\ 


^pr 


Val 

V CI 1 


1 PI J 




60 








65 








70 












gtt qcc aat 

5**** JJ%»W 


aac 

33^ 


tat 


cga 


cac 


aac 


ate 


ace tct 


gtc 


tct 


cag 


tge 


tgt 


472 


Val Ala Ser 


Glv 


Tyr 


Arq 


His 


Asn 


He 


Thr Ser 


Val 


Ser 


Gin 


Cys 


Cys 




75 






80 








85 










90 




acc ate age 


age 


ctt 


aaa 


aag 


gtg 


agg 


gtg tgg 


ctg 


cac 


tge 


gtg 


ggg 


520 


Thr He Ser 


Ser 


Leu 


Lys 


Lys 


Val 


Arg 


Val Trp 


Leu 


His 


Cys 


Val 


Gly 








95 










100 








105 






aac cag cgt 


ggg 


gag 


etc 


gag 


ate 


ttc 


aeg get 


agg 


gcc 


tge 


cag 


tgt 


568 


Asn Gin Arg 


Gly 


Glu 


Leu 


Glu 


He 


Phe 


Thr Ala 


Arg 


Ala 


Cys 


Gin 


Cys 






110 










115 








120 








gat atg tge 


cgt 


etc 


tee 


cgc 


tac 


tag 


gecccgaagg getcaggect 




615 


Asp Met Cys 


Arg 


Leu 


Ser 


Arg 


Tyr 



















125 130 

ccagtcctgc cactgatagg tcgtgcttct ctcagaccag ccctctttgg agtctgaaga 675 
tggggcttcg cctctgttta cctggcctcc tcagcagtct cactgctgct ttctccttca 735 
cccctgtcct caataaagca ggcagtgctt ga 767 

<210> 39 
<211> 130 
<212> PRT 

<213> Rattus norvegicus 
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<400> 39 

Met Pro Met Ala Pro Arg Val Leu Leu Phe Cys Leu Leu Gly Leu Ala 
15 10 15 

Val Thr Glu Gly His Gly Leu Glu Ala Ala Val Pro He Pro Gly Cys 

20 25 30 

His Leu His Pro Phe Asn Val Thr Val Arg Ser Asp Arg His Gly Thr 

35 40 45 

Cys Gin Gly Ser His Val Ala Gin Ala Cys Val Gly His Cys Glu Ser 

50 55 60 

Ser Ala Phe Pro Ser Arg Tyr Ser Val Leu Val Ala Ser Gly Tyr Arg 
65 70 75 80 

His Asn He Thr Ser Val Ser Gin Cys Cys Thr He Ser Ser Leu Lys 

85 90 95 

Lys Val Arg Val Trp Leu His Cys Val Gly Asn Gin Arg Gly Glu Leu 

100 105 110 

Glu He Phe Thr Ala Arg Ala Cys Gin Cys Asp Met Cys Arg Leu Ser 
115 120 125 

Arg Tyr 
130 

<210> 40 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 40 

ccgtttctcc cgctacta 18 

<210> 41 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 41 

gggccaacct catcttca 18 

<210> 42 
<211> 32 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer (ZC17438) 
<400> 42 

gatcagggcc ggccaccatg cctatggcgt cc 

<210> 43 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer (ZC17439) 
<400> 43 

gatcagggcg cgccctagta gcgagagagg eg 

<210> 44 
■ <211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer (ZC17950) 
<400> 44 

cgtatcggcc ggccaccatg cccatggcac ca 

<210> 45 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer (ZC17951) 
<400> 45 

cgtacgggcg cgccctagta gcgggagaaa eg 

<210> 46 
<2n> 6 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Peptide tag 

<400> 46 
Glu Tyr Met Pro Met Glu 
1 5 

<210> 47 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide 

<400> 47 
Glu Tyr Met Pro Val Asp 

1 5. 

<210> 48 
<211> 21 
<212> ONA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer (ZC12700) 

<400> 48 
ggaggtctat ataagcagag c 

<210> 49 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer (ZC12742) 

<400> 49 
ttatgtttca ggttcagggg 
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